Acoustic Look-Ahead for More Efficient Decoding in LVCSR

机译：在LVCSR中进行声学前瞻以实现更高效的解码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose novel approximations of a generalized acoustic look-ahead to speed up the search process in large vocabulary continuous speech recognition (LVCSR). Unlike earlier methods, we do not employ any phoneme- or syllable level heuristics. First we define and analyze the perfect acoustic look-ahead as a simple pre-evaluation of the original acoustic models into the future. This method is very slow, but reveals the best possible impact on the search space that can be achieved through acoustic look-ahead. In a second step, we derive efficient and simple approximative look-ahead models from the perfect models. We show that the approximative models compare well to the perfect models regarding the search space, and that the approximative models significantly improve the efficiency in comparison to the baseline, without any negative effect on the precision.

机译：在本文中，我们提出了一种通用的声学预见的新颖近似方法，以加快大词汇量连续语音识别（LVCSR）的搜索过程。与以前的方法不同，我们不使用任何音素或音节水平试探法。首先，我们定义并分析理想的声学前瞻，作为对未来声学模型的简单预评估。此方法非常慢，但是可以通过声学预视来获得对搜索空间的最佳影响。第二步，我们从完美模型中得出有效且简单的近似预见模型。我们表明，对于搜索空间，逼近模型与完美模型具有很好的比较，并且与基线相比，逼近模型显着提高了效率，而对精度没有任何负面影响。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.900-903|共4页
会议地点
作者
D. Nolden; R. Schlueter; H. Ney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speech recognition; search; acoustic look-ahead; efficiency;

机译：语音识别;搜索;声音超前效率;

相似文献

外文文献
中文文献
专利

1. Hardware Efficient and Low Latency Implementations of Look-Ahead ACS Computation for Viterbi Decoders [J] . Kazuhito ITO, Ryoto SHIRASAKA IEICE Transactions on fundamentals of electronics, communications & computer sciences . 2013,第12期

机译：Viterbi解码器的前瞻ACS计算的硬件高效和低延迟实现
2. Intra-dance variation among waggle runs and the design of efficient protocols for honey bee dance decoding Intra-dance variation among waggle runs and the design of efficient protocols for honey bee dance decoding Intra-dance variation among waggle runs and the design of efficient protocols for honey bee dance decoding [J] . Amanda M. Kuepfer, Elisabeth L. Harris-Jones, Francis L. W. Ratnieks, Biology Open . 2012,第5期

机译：摇摆游走之间的舞间变异和蜂舞解码的有效协议的设计摇摆游走之间的舞间变异和蜂舞解码的有效协议的设计摇摆跑之间的舞间变异和蜂舞的有效协议的设计蜜蜂舞解码
3. Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach [J] . Kai Chen, Qiang Huo Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第7期

机译：通过上下文敏感块BPTT方法训练LVCSR的深度双向LSTM声学模型
4. Advanced search space pruning with acoustic look-ahead for WFST based LVCSR [C] . Nolden David, Schluter Ralf, Ney Hermann IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：基于WFST的LVCSR的声学前瞻性高级搜索空间修剪
5. Search and decoding strategies for complex lexical modeling in LVCSR [D] . Deoras, Anoop 2011

机译：LVCSR中复杂词法建模的搜索和解码策略
6. Decoding spatial attention with EEG and virtual acoustic space [O] . Yue Dong, Kaan E. Raif, Sarah C. Determan, 2017

机译：使用EEG和虚拟声学空间解码空间注意力
7. Acoustic Look-Ahead for More Efficient Decoding in LVCSR [O] . Nolden David, Schlüter Ralf, Ney Hermann 2011

机译：LVCSR中的声学前瞻可实现更高效的解码

Acoustic Look-Ahead for More Efficient Decoding in LVCSR

摘要

著录项

相似文献

相关主题

期刊订阅