Maximum Entropy Direct Model as a Unified Model for Acoustic Modeling in Speech Recognition

机译：最大熵直接模型作为语音识别声学模型的统一模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional statistical models for speech recognition have been dominated by generative models such as Hidden Markov Models (HMMs). We recently proposed a new framework for speech recognition using maximum entropy direct modeling, where the probability of a state or word sequence given an observation sequence is computed directly from the model. In contrast to HMMs, features can be non-independent, asynchronous, and overlapping. In this paper, we discuss how to make the computationally intensive training of such models feasible through parallelizing the IIS (Improved Iterative Scaling) algorithm. The direct model significantly outperforms traditional HMMs in word error rate when used as stand-alone acoustic models. Modest improvements over the best HMM system are seen when combined with HMM and language model scores. The maximum entropy model can potentially incorporate non-independent features such as acoustic phonetic features in a way that is robust to missing features due to mismatch between training and testing.

机译：传统的语音识别统计模型已被诸如隐马尔可夫模型（HMM）之类的生成模型所支配。我们最近提出了一种使用最大熵直接建模的语音识别新框架，其中直接从模型中计算出给定观察序列的状态或单词序列的概率。与HMM相比，功能可以是非独立的，异步的和重叠的。在本文中，我们讨论了如何通过并行化IIS（改进的迭代缩放）算法来使此类模型的计算密集型训练可行。当用作独立声学模型时，直接模型的字误码率明显优于传统HMM。与HMM和语言模型得分结合使用时，可以看到对最佳HMM系统的适度改进。最大熵模型可以潜在地合并非独立特征（例如声学语音特征），这种方式对于由于训练和测试之间的不匹配而导致的缺失特征具有鲁棒性。

著录项

来源
《International Conference on Spoken Language Processing; 20041004-08; Jeju(KR)》|2004年|P.681-684|共4页
会议地点 Jeju(KR)
作者
Hong-Kwang Jeff Kuo; Yuqing Gao;
展开▼
作者单位

IBM T. J. Watson Research Center, 1101 Kitchawan Road, Route 134, Yorktown Heights, NY 10598;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类应用语言学;
关键词

相似文献

外文文献
中文文献
专利

1. Maximum entropy direct models for speech recognition [J] . Hong-Kwang Jeff Kuo, Yuqing Gao IEEE transactions on audio, speech and language processing . 2006,第3期

机译：语音识别的最大熵直接模型
2. Development of a Mandarin-English Bilingual Speech Recognition System with Unified Acoustic Models [J] . Qing-Qing Zhang, Jie-Lin Pan, Yong-Hong Yan Journal of information science and engineering . 2010,第4期

机译：统一声学模型的中英文双语语音识别系统的开发
3. Speech Recognition Based on Unified Model of Acoustic and Language Aspects of Speech [J] . Yotaro Kubo, Atsunori Ogawa, Takaaki Hori, NTT Technical Review . 2013,第12期

机译：基于语音的语言和语言方面统一模型的语音识别
4. Maximum Entropy Direct Model as a Unified Model for Acoustic Modeling in Speech Recognition [C] . Hong-Kwang Jeff Kuo, Yuqing Gao, International Speech Communication Association International Conference on Spoken Language Processing . 2004

机译：最大熵直接模型作为语音识别中声学建模的统一模型
5. Hidden Markov models, maximum mutual information estimation, and the speech recognition problem [D] . Normandin, Yves. 1991

机译：隐藏的马尔可夫模型，最大互信息估计和语音识别问题
6. Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models [O] . A. Paats, T. Alumäe, E. Meister, 2018

机译：一项爱沙尼亚放射线语音识别系统临床表现的回顾性分析：不同声学和语言模型的影响
7. Context-dependent acoustic modeling based on hidden maximum entropy model for statistical parametric speech synthesis [O] . Soheil Khorram, Hossein Sameti, Fahimeh Bahmaninezhad, 2014

机译：基于隐藏最大熵模型的上下文相关声学建模，用于统计参数语音合成
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Maximum Entropy Direct Model as a Unified Model for Acoustic Modeling in Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅