首页> 外文期刊>IEEE transactions on audio, speech and language processing >Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition
【24h】

Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition

机译:高效的基于WFST的单遍解码,具有即时假设,可极大地记录词汇量,并能连续语音识别

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a novel one-pass search algorithm with on-the-fly composition of weighted finite-state transducers (WFSTs) for large-vocabulary continuous-speech recognition. In the standard search method with on-the-fly composition, two or more WFSTs are composed during decoding, and a Viterbi search is performed based on the composed search space. With this new method, a Viterbi search is performed based on the first of the two WFSTs. The second WFST is only used to rescore the hypotheses generated during the search. Since this rescoring is very efficient, the total amount of computation required by the new method is almost the same as when using only the first WFST. In a 65k-word vocabulary spontaneous lecture speech transcription task, our proposed method significantly outperformed the standard search method. Furthermore, our method was faster than decoding with a single fully composed and optimized WFST, where our method used only 38% of the memory required for decoding with the single WFST. Finally, we have achieved high-accuracy one-pass real-time speech recognition with an extremely large vocabulary of 1.8 million words
机译:本文提出了一种新颖的单遍搜索算法,该算法具有动态组成的加权有限状态换能器(WFST),用于大词汇量连续语音识别。在具有即时合成的标准搜索方法中,在解码期间合成两个或多个WFST,然后基于合成的搜索空间执行维特比搜索。使用这种新方法,将基于两个WFST中的第一个执行Viterbi搜索。第二个WFST仅用于重新计算在搜索过程中生成的假设。由于这种记录非常有效,因此新方法所需的计算总量与仅使用第一个WFST时几乎相同。在65k字词的自发演讲语音转录任务中,我们提出的方法明显优于标准搜索方法。此外,我们的方法比使用单个完全组合和优化的WFST进行解码要快,在该方法中,我们的方法仅使用了使用单个WFST进行解码所需的内存的38%。最终,我们以180万个单词的超大词汇量实现了高精度的一次通过实时语音识别

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号