首页> 外文会议>Annual conference of the International Speech Communication Association >Efficient On-The-Fly Hypothesis Rescoring in a Hybrid GPU/CPU-based Large Vocabulary Continuous Speech Recognition Engine
【24h】

Efficient On-The-Fly Hypothesis Rescoring in a Hybrid GPU/CPU-based Large Vocabulary Continuous Speech Recognition Engine

机译:基于混合GPU / CPU的大词汇量连续语音识别引擎中的有效即时假设记录

获取原文
获取外文期刊封面目录资料

摘要

Effectively exploiting the resources available on modern mul-ticore and manycore processors for tasks such as large vocabulary continuous speech recognition (LVCSR) is far from trivial. While prior works have demonstrated the effectiveness of manycore graphic processing units (GPU) for high-throughput, limited vocabulary speech recognition, they are unsuitable for recognition with large acoustic and language models due to the limited 1-6GB of memory on GPUs. To overcome this limitation, we introduce a novel architecture for WFST-based LVCSR that jointly leverages manycore graphic processing units (GPU) and multicore processors (CPU) to efficiently perform recognition even when large acoustic and language models are applied. In the proposed approach, recognition is performed on the GPU using an H-level WFST, composed using a unigram language model. During decoding partial hypotheses generated over this network are rescored on-the-fly using a large language model, which resides on the CPU. By maintaining N-best hypotheses during decoding our proposed architecture obtains comparable accuracy to a standard CPU-based WFST decoder while improving decoding speed by a factor of 11 ×.
机译:有效地利用现代多核和许多核处理器上的资源来完成诸如大词汇量连续语音识别(LVCSR)之类的任务并非易事。尽管先前的工作已经证明了许多核心图形处理单元(GPU)对于高吞吐量,有限的词汇语音识别的有效性,但由于GPU上的1-6GB内存有限,因此它们不适用于大型声学和语言模型。为克服此限制,我们为基于WFST的LVCSR引入了一种新颖的体系结构,该体系结构联合利用多核图形处理单元(GPU)和多核处理器(CPU)来有效地执行识别,即使在应用大型声学和语言模型时也是如此。在提出的方法中,使用H-level WFST在GPU上执行识别,该WFST由unigram语言模型组成。在解码过程中,使用驻留在CPU上的大型语言模型实时重述通过该网络生成的部分假设。通过在解码过程中保持N个最佳假设,我们提出的体系结构可以获得与基于标准CPU的WFST解码器相当的准确性,同时将解码速度提高了11倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号