首页> 外文会议>International Conference on Spoken Language Processing >Fast On-The-Fly Composition for Weighted Finite-State Transducers in 1.8 Million-Word Vocabulary Continuous Speech Recognition
【24h】

Fast On-The-Fly Composition for Weighted Finite-State Transducers in 1.8 Million-Word Vocabulary Continuous Speech Recognition

机译:快速的有限状态传感器的快速开启组合物,在180万字词汇连续语音识别中

获取原文

摘要

This paper proposes a new on-the-fly composition algorithm for Weighted Finite-State Transducers (WFSTs) in large-vocabulary continuous-speech recognition. In general on-the-fly composition, two transducers are composed during decoding, and a Viterbi search is performed based on the composed search space. In this new method, a Viterbi search is performed based on the first of two transducers. The second transducer is only used to rescore the hypotheses generated during the search. Since this rescoring is very efficient, the total amount of computation in the new method is almost the same as when using only the first transducer. In a 30k-word vocabulary spontaneous lecture speech transcription task, our proposed method significantly outperformed the general on-the-fly composition method. Furthermore the speed of our method was slightly faster than that of decoding with a single fully composed and optimized WFST, where our method consumed only 20% of the memory usage required for decoding with the single WFST. Finally, we have achieved one-pass real-time speech recognition in an extremely large vocabulary of 1.8 million words.
机译:本文提出了一种新的大词汇连续语音识别中加权有限状态传感器(WFST)的新型开发组合算法。通常,在------------unstoce中,在解码期间组合两个换能器,并且基于组合的搜索空间执行维特比搜索。在这种新方法中,基于两个传感器中的第一个来执行维特比搜索。第二换能器仅用于重置在搜索期间生成的假设。由于该备用非常有效,因此新方法中的总计算量几乎与仅使用第一换能器时的计算。在30k字词汇自发性讲义语音转录任务中,我们所提出的方法显着优于一般的逐个组成方法。此外,通过单个完全组成和优化的WFST进行解码,我们的方法的速度略微速度略微快,其中,我们的方法仅消耗了用单个WFST解码所需的20%的内存使用量。最后,我们在180万字的极大词汇表中取得了一次通过的实时语音识别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号