Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders

机译：沉默是黄金：在基于WFST的动态网络解码器中对非语音事件建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Models for silence are a fundamental part of continuous speech recognition systems. Depending on application requirements, audio data segmentation, and availability of detailed training data annotations, it may be necessary or beneficial to differentiate between other non-speech events, for example breath and background noise. The integration of multiple non-speech models in a WFST-based dynamic network decoder is not straightforward, because these models do not perfectly fit in the transducer framework. This paper describes several options for the transducer construction with multiple non-speech models, shows their considerable different characteristics in memory and runtime efficiency, and analyzes the impact on the recognition performance.

机译：沉默模型是连续语音识别系统的基本组成部分。根据应用程序要求，音频数据分段和详细训练数据注释的可用性，可能有必要或有必要在其他非语音事件之间进行区分，例如呼吸和背景噪音。在基于WFST的动态网络解码器中集成多个非语音模型并不是一件容易的事，因为这些模型不能完美地适合于换能器框架。本文介绍了具有多个非语音模型的换能器构造的几个选项，显示了它们在内存和运行时效率方面的显着不同特性，并分析了对识别性能的影响。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4205- 4208|共4页
会议地点 Kyoto(JP)
作者
Rybach, David;
展开▼
作者单位

Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University 52056 Germany;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. List-decoding methods for inferring polynomials in finite dynamical gene network models. [J] . Dingel J, Milenkovic O Bioinformatics . 2009,第13期

机译：有限动态基因网络模型中多项式的列表解码方法。
2. List-decoding methods for inferring polynomials in finite dynamical gene network models [J] . Janis Dingel1* and Olgica Milenkovic2* Bioinformatics . 2009,第13期

机译：有限动力基因网络模型中多项式的列表解码方法
3. One-Pass Semi-Dynamic Network Decoding Using a Subnetwork Caching Model for Large Vocabulary Continuous Speech Recongnition [J] . Dong-Hoon AHN, Minhwa CHUNG IEICE Transactions on Information and Systems . 2004,第5期

机译：大词汇量连续语音识别使用子网络缓存模型的一次通过半动态网络解码
4. Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders [C] . Rybach David IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：沉默是金色的：在基于WFST的动态网络解码器中建模非语音事件
5. Dispersion modelling during particulate matter episode events in Golden, British Columbia. [D] . Abel, Tyler. 2011

机译：不列颠哥伦比亚省戈尔登的颗粒物事件期间的扩散模型。
6. Using the relational event model (REM) to investigate the temporal dynamics of animal social networks [O] . Mark Tranmer, Christopher Steven Marcum, F. Blake Morton, -1

机译：使用关系事件模型（REM）调查动物社交网络的时间动态
7. SILENCE IS GOLDEN: MODELING NON-SPEECH EVENTS IN WFST-BASED DYNAMIC NETWORK DECODERS [O] . David Rybach, Hermann Ney 2015

机译：沉默是金色的：在基于WFsT的动态网络解码器中模拟非语音事件

Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders

摘要

著录项

相似文献

相关主题

期刊订阅