首页> 外文会议> >Visual speech recognition using active shape models and hidden Markov models
【24h】

Visual speech recognition using active shape models and hidden Markov models

机译:使用主动形状模型和隐马尔可夫模型的视觉语音识别

获取原文
获取外文期刊封面目录资料

摘要

This paper describes a novel approach for visual speech recognition. The shape of the mouth is modelled by an active shape model which is derived from the statistics of a training set and used to locate, track and parameterise the speaker's lip movements. The extracted parameters representing the lip shape are modelled as continuous probability distributions and their temporal dependencies are modelled by hidden Markov models. We present recognition tests performed on a database of a broad variety of speakers and illumination conditions. The system achieved an accuracy of 85.42% for a speaker independent recognition task of the first four digits using lip shape information only.
机译:本文介绍了一种视觉语音识别的新方法。嘴的形状是通过主动形状模型来建模的,该模型是从训练集的统计数据中得出的,并用于定位,跟踪和参数化说话人的嘴唇运动。提取的代表嘴唇形状的参数被建模为连续概率分布,并且它们的时间依赖性通过隐藏的马尔可夫模型进行建模。我们介绍了对多种扬声器和照明条件的数据库进行的识别测试。该系统仅使用嘴唇形状信息就前四位数字的说话者独立识别任务实现了85.42%的准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号