Fast On-The-Fly Composition for Weighted Finite-State Transducers in 1.8 Million-Word Vocabulary Continuous Speech Recognition

机译：快速的有限状态传感器的快速开启组合物，在180万字词汇连续语音识别中

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new on-the-fly composition algorithm for Weighted Finite-State Transducers (WFSTs) in large-vocabulary continuous-speech recognition. In general on-the-fly composition, two transducers are composed during decoding, and a Viterbi search is performed based on the composed search space. In this new method, a Viterbi search is performed based on the first of two transducers. The second transducer is only used to rescore the hypotheses generated during the search. Since this rescoring is very efficient, the total amount of computation in the new method is almost the same as when using only the first transducer. In a 30k-word vocabulary spontaneous lecture speech transcription task, our proposed method significantly outperformed the general on-the-fly composition method. Furthermore the speed of our method was slightly faster than that of decoding with a single fully composed and optimized WFST, where our method consumed only 20% of the memory usage required for decoding with the single WFST. Finally, we have achieved one-pass real-time speech recognition in an extremely large vocabulary of 1.8 million words.

机译：本文提出了一种新的大词汇连续语音识别中加权有限状态传感器（WFST）的新型开发组合算法。通常，在------------unstoce中，在解码期间组合两个换能器，并且基于组合的搜索空间执行维特比搜索。在这种新方法中，基于两个传感器中的第一个来执行维特比搜索。第二换能器仅用于重置在搜索期间生成的假设。由于该备用非常有效，因此新方法中的总计算量几乎与仅使用第一换能器时的计算。在30k字词汇自发性讲义语音转录任务中，我们所提出的方法显着优于一般的逐个组成方法。此外，通过单个完全组成和优化的WFST进行解码，我们的方法的速度略微速度略微快，其中，我们的方法仅消耗了用单个WFST解码所需的20％的内存使用量。最后，我们在180万字的极大词汇表中取得了一次通过的实时语音识别。

著录项

来源
《International Conference on Spoken Language Processing》|2004年||共4页
会议地点
作者
Takaaki Hori; Chiori Hori; Yasuhiro Minami; International Speech Communication Association;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类应用语言学;
关键词

相似文献

外文文献
中文文献
专利

1. Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition [J] . Hori T., Hori C., Minami Y., IEEE transactions on audio, speech and language processing . 2007,第4期

机译：高效的基于WFST的单遍解码，具有即时假设，可极大地记录词汇量，并能连续语音识别
2. Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition [J] . Kubo Y., Watanabe S., Hori T., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第8期

机译：基于加权有限状态传感器的语音识别结构分类方法
3. Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition [J] . Lehr M., Shafran I. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：学习用于语音识别的判别加权有限状态传感器
4. Fast On-The-Fly Composition for Weighted Finite-State Transducers in 1.8 Million-Word Vocabulary Continuous Speech Recognition [C] . Takaaki Hori, Chiori Hori, Yasuhiro Minami International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) . 2004

机译：180万词词汇连续语音识别中加权有限状态传感器的快速动态组成
5. Flexible speech synthesis using weighted finite-state transducers. [D] . Bulyko, Ivan. 2002

机译：使用加权有限状态换能器的灵活语音合成。
6. Recognition of time-compressed speech does not predict recognition of natural fast-rate speech by older listeners [O] . Sandra Gordon-Salant, Danielle J. Zion, Carol Espy-Wilson -1

机译：时间压缩语音的识别无法预测年长听众对自然快速语音的识别
7. Large Vocabulary Continuous Speech Recognition Using Weighted Finite-State Transducers [O] . Diamantino Caseiro, Isabel Trancoso 2002

机译：使用加权有限状态转换器的大词汇量连续语音识别

Fast On-The-Fly Composition for Weighted Finite-State Transducers in 1.8 Million-Word Vocabulary Continuous Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅