Recognition of visual speech elements using adaptively boosted hidden Markov models

Say Wei Foo; Yong Lian; Liang Dong

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Recognition of visual speech elements using adaptively boosted hidden Markov models

【24h】

Recognition of visual speech elements using adaptively boosted hidden Markov models

机译：使用自适应增强隐马尔可夫模型识别视觉语音元素

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum-Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.

机译：自动语音识别（ASR）系统的性能可以通过视觉语音元素（如嘴唇，舌头和牙齿的运动）中的其他信息（特别是在嘈杂的环境中）获得更多信息。在本文中，提出了一种新颖的视觉语音元素识别方法。该方法利用自适应增强（AdaBoost）和隐马尔可夫模型（HMM）来构建AdaBoost-HMM分类器。使用AdaBoost技术和偏倚的Baum-Welch训练方法对AdaBoost-HMM分类器的复合HMM进行训练，使其覆盖不同的训练样本组。通过根据新颖的概率合成规则组合复合HMM的组件分类器的决策，与使用单个HMM分类器相比，制定了更为复杂的决策边界。该方法适用于基本视觉语音元素的识别。实验结果表明，AdaBoost-HMM分类器在准确性方面优于传统的HMM分类器，尤其是从上下文中提取视位素时。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2004年第5期|p.693-705|共13页
作者
Say Wei Foo; Yong Lian; Liang Dong;
展开▼
作者单位

Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speech recognition; hidden Markov models; gesture recognition; pattern classification; probability; video signal processing; visual speech element recognition; adaptively boosted hidden Markov models; automatic speech recognition systems; Baum-Welch training method; probability synthesis rule; decision boundary; automatic lip reading; visual speech processing; time series distribution; spread-out distribution; visemes identification; computational complexity;

机译：语音识别;隐马尔可夫模型;手势识别;模式分类;概率;视频信号处理;视觉语音元素识别;自适应增强隐马尔可夫模型;自动语音识别系统;鲍姆-韦尔奇训练方法;概率合成规则;决策边界;自动嘴唇阅读;视觉语音处理;时间序列分布;展开分布;图像识别;计算复杂度;

相似文献

外文文献
中文文献
专利

1. Recognition of visual speech elements using adaptively boosted hidden Markov models [J] . Say Wei Foo, Yong Lian, Liang Dong IEEE Transactions on Circuits and Systems for Video Technology . 2004,第5期

机译：使用自适应增强隐马尔可夫模型识别视觉语音元素
2. Recognition of visual speech elements using adaptively boosted hidden Markov models [J] . Say Wei Foo, Yong Lian, Liang Dong IEEE Transactions on Circuits and Systems for Video Technology . 2004,第5期

机译：使用自适应升压隐马尔可夫模型识别视觉语音元素
3. Evolutionary structure of hidden Markov models for audio-visual Arabic speech recognition [J] . Amina Makhlouf, Lilia Lazli, Bachir Bensaker International Journal of Signal and Imaging Systems Engineering . 2016,第1期

机译：视听阿拉伯语音识别的隐马尔可夫模型的演化结构
4. Recognition of Visual Speech Elements Using Hidden Markov Models [C] . Say Wei Foo, Liang Dong Third IEEE Pacific Rim Conference on Multimedia PCM 2002, Dec 16-18, 2002, Hsinchu, Taiwan . 2002

机译：使用隐马尔可夫模型的视觉语音元素识别
5. Online Learning of Large Margin Hidden Markov Models for Automatic Speech Recognition. [D] . Cheng, Chih-Chieh. 2011

机译：在线学习大余量隐马尔可夫模型以进行自动语音识别。
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. Visual Speech Recognition Using Active Shape Models And Hidden Markov Models [O] . Juergen Luettin, Neil A. Thacker, Steve W. Beet 1996

机译：使用主动形状模型和隐马尔可夫模型的视觉语音识别
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Recognition of visual speech elements using adaptively boosted hidden Markov models

摘要

著录项

相似文献

相关主题

期刊订阅