Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition

Deng L.; Kenny P.

首页> 外文期刊>IEEE Transactions on Signal Processing >Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition

【24h】

Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition

机译：具有连续混合输出密度的音素隐马尔可夫模型，可用于大词汇量单词识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The authors demonstrate the effectiveness of phonemic hidden Markov models with Gaussian mixture output densities (mixture HMMs) for speaker-dependent large-vocabulary word recognition. Speech recognition experiments show that for almost any reasonable amount of training data, recognizers using mixture HMMs consistently outperform those employing unimodal Gaussian HMMs. With a sufficiently large training set (e.g. more than 2500 words), use of HMMs with 25-component mixture distributions typically reduces recognition errors by about 40%. It is also found that the mixture HMMs outperform a set of unimodal generalized triphone models having the same number of parameters. Previous attempts to employ mixture HMMs for speech recognition proved discouraging because of the high complexity and computational cost in implementing the Baum-Welch training algorithm. It is shown how mixture HMMs can be implemented very simply in unimodal transition-based frameworks by allowing multiple transitions from one state to another.

机译：作者展示了具有高斯混合输出密度（混合HMM）的语音隐马尔可夫模型对于说话者相关的大词汇量单词识别的有效性。语音识别实验表明，对于几乎任何合理数量的训练数据，使用混合HMM的识别器始终优于使用单峰高斯HMM的识别器。在足够大的训练集（例如超过2500个单词）的情况下，使用具有25种成分的混合物分布的HMM通常可以将识别错误减少约40％。还发现混合HMM优于一组具有相同数量参数的单峰广义三音器模型。由于采用Baum-Welch训练算法的复杂性和计算成本较高，以前采用混合HMM进行语音识别的尝试令人沮丧。它显示了如何通过允许从一种状态到另一种状态的多次转换，在基于单峰转换的框架中非常简单地实现混合HMM。

著录项

来源
《IEEE Transactions on Signal Processing》 |1991年第7期|P.1677-1681|共5页
作者
Deng L.; Kenny P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. Features Modelling in Discrete and Continuous Hidden Markov Models for Handwritten Arabic Words Recognition [J] . Benzenache Amine, Seridi Hamid, Akdag Herman The international arab journal of information technology . 2017,第5期

机译：离散和连续隐马尔可夫模型中的特征建模用于手写阿拉伯语单词识别
2. Multonic Markov word models for large vocabulary continuous speech recognition [J] . Bahl L.R., Bellegarda J.R. IEEE Transactions on Speech and Audio Proceeding . 1993,第3期

机译：用于大词汇量连续语音识别的Multonic Markov单词模型
3. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
4. On-line recognition of handwritten characters applying hidden Markov models with continuous mixture densities [C] . Yang, L., Prasad, . 1994

机译：使用隐藏的马尔可夫模型和连续混合密度在线识别手写字符
5. A discrete hidden Markov model for the recognition of handwritten Farsi words [D] . Jifroodian-Haghighi, Puntis. 2010

机译：用于识别波斯波斯文字的离散隐马尔可夫模型
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. Offline cursive word recognition using continuous density hidden Markov models trained with PCA or ICA features [O] . A. Vinciarelli, S. Bengio -1

机译：使用连续密度隐马尔可夫型号的离线法学字识别，用PCA或ICA功能培训

Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅