首页> 外文会议>International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) >Maximum Entropy Direct Model as a Unified Model for Acoustic Modeling in Speech Recognition
【24h】

Maximum Entropy Direct Model as a Unified Model for Acoustic Modeling in Speech Recognition

机译:最大熵直接模型作为语音识别声学模型的统一模型

获取原文
获取原文并翻译 | 示例

摘要

Traditional statistical models for speech recognition have been dominated by generative models such as Hidden Markov Models (HMMs). We recently proposed a new framework for speech recognition using maximum entropy direct modeling, where the probability of a state or word sequence given an observation sequence is computed directly from the model. In contrast to HMMs, features can be non-independent, asynchronous, and overlapping. In this paper, we discuss how to make the computationally intensive training of such models feasible through parallelizing the IIS (Improved Iterative Scaling) algorithm. The direct model significantly outperforms traditional HMMs in word error rate when used as stand-alone acoustic models. Modest improvements over the best HMM system are seen when combined with HMM and language model scores. The maximum entropy model can potentially incorporate non-independent features such as acoustic phonetic features in a way that is robust to missing features due to mismatch between training and testing.
机译:传统的语音识别统计模型已被诸如隐马尔可夫模型(HMM)之类的生成模型所支配。我们最近提出了一种使用最大熵直接建模的语音识别新框架,其中直接从模型中计算出给定观察序列的状态或单词序列的概率。与HMM相比,功能可以是非独立的,异步的和重叠的。在本文中,我们讨论了如何通过并行化IIS(改进的迭代缩放)算法来使此类模型的计算密集型训练可行。当用作独立声学模型时,直接模型的字误码率明显优于传统HMM。与HMM和语言模型得分结合使用时,可以看到对最佳HMM系统的适度改进。最大熵模型可以潜在地合并非独立特征(例如声学语音特征),这种方式对于由于训练和测试之间的不匹配而导致的缺失特征具有鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号