...
首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Recognition of visual speech elements using adaptively boosted hidden Markov models
【24h】

Recognition of visual speech elements using adaptively boosted hidden Markov models

机译:使用自适应增强隐马尔可夫模型识别视觉语音元素

获取原文
获取原文并翻译 | 示例
           

摘要

The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum-Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.
机译:自动语音识别(ASR)系统的性能可以通过视觉语音元素(如嘴唇,舌头和牙齿的运动)中的其他信息(特别是在嘈杂的环境中)获得更多信息。在本文中,提出了一种新颖的视觉语音元素识别方法。该方法利用自适应增强(AdaBoost)和隐马尔可夫模型(HMM)来构建AdaBoost-HMM分类器。使用AdaBoost技术和偏倚的Baum-Welch训练方法对AdaBoost-HMM分类器的复合HMM进行训练,使其覆盖不同的训练样本组。通过根据新颖的概率合成规则组合复合HMM的组件分类器的决策,与使用单个HMM分类器相比,制定了更为复杂的决策边界。该方法适用于基本视觉语音元素的识别。实验结果表明,AdaBoost-HMM分类器在准确性方面优于传统的HMM分类器,尤其是从上下文中提取视位素时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号