Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding

Lecouteux B.; Linares G.; Esteve Y.; Gravier G.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding

【24h】

Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding

机译：通过驱动解码实现自动语音识别系统的动态组合

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Combining automatic speech recognition (ASR) systems generally relies on the posterior merging of the outputs or on acoustic cross-adaptation. In this paper, we propose an integrated approach where outputs of secondary systems are integrated in the search algorithm of a primary one. In this driven decoding algorithm (DDA), the secondary systems are viewed as observation sources that should be evaluated and combined to others by a primary search algorithm. DDA is evaluated on a subset of the ESTER I corpus consisting of 4 hours of French radio broadcast news. Results demonstrate DDA significantly outperforms vote-based approaches: we obtain an improvement of 14.5% relative word error rate over the best single-systems, as opposed to the the 6.7% with a ROVER combination. An in-depth analysis of the DDA shows its ability to improve robustness (gains are greater in adverse conditions) and a relatively low dependency on the search algorithm. The application of DDA to both and beam-search-based decoder yields similar performances.

机译：组合自动语音识别（ASR）系统通常依赖于输出的后合并或声学交叉适应。在本文中，我们提出了一种集成方法，其中将次级系统的输出集成到初级系统的搜索算法中。在此驱动解码算法（DDA）中，辅助系统被视为观察源，应通过主要搜索算法对其进行评估并与其他系统组合。 DDA是在ESTER I语料库的一个子集中进行评估的，该子集包含4个小时的法国广播新闻。结果表明，DDA明显优于基于投票的方法：相对于最佳的单系统，相对单词错误率提高了14.5％，而使用ROVER组合则为6.7％。对DDA的深入分析表明，它具有提高鲁棒性的能力（在不利条件下收益更大），并且对搜索算法的依赖性相对较低。 DDA在基于波束搜索的解码器和基于波束搜索的解码器上的应用都具有相似的性能。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第6期|1251-1260|共10页
作者
Lecouteux B.; Linares G.; Esteve Y.; Gravier G.;
展开▼
作者单位

GETALP Team, Univ. of Grenoble Alpes, Grenoble, France|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Automatic speech recognition; speech processing; system combination;

机译：自动语音识别;语音处理;系统组合;

相似文献

外文文献
中文文献
专利

1. Auditory driven subband speech enhancement for automatic recognition of noisy speech [J] . Navneet Upadhyay, Hamurabi Gamboa Rosales International journal of speech technology . 2016,第4期

机译：听觉驱动的子带语音增强功能可自动识别嘈杂的语音
2. Information-theoretic analysis of efficiency of the phonetic encoding-decoding method in automatic speech recognition [J] . Savchenko V. V., Savchenko A. V. NTT R&D . 2016,第4期

机译：语音语音识别中语音编解码方法效率的信息论分析
3. Corrections to "Segmental minimum Bayes-risk decoding for automatic speech recognition" [J] . Goel V., Kumar S., Byrne W. IEEE transactions on audio, speech and language processing . 2006,第1期

机译：对“用于自动语音识别的分段最小贝叶斯风险解码”的更正
4. Generalized driven decoding for speech recognition system combination [C] . Lecouteux, B., Linares, Personal, Indoor and Mobile Radio Communications,2005 IEEE 16th International Symposium on . 2008

机译：语音识别系统组合的通用驱动解码
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech [O] . Jodi Kodish-Wachs, Emin Agassi, Patrick Kenny III, 2018

机译：当代自动语音识别引擎用于对话式临床语音的系统比较
7. Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding [O] . Benjamin Lecouteux, Georges Linares, Yannick Estève, 2014

机译：基于驱动解码的自动语音识别系统动态组合

Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅