首页> 外文会议>IEE Colloquium on Applied Statistical Process Control, 1990 >HMM-based audio-visual speech recognition integrating geometric and appearance-based visual features
【24h】

HMM-based audio-visual speech recognition integrating geometric and appearance-based visual features

机译:基于HMM的视听语音识别集成了几何和基于外观的视觉特征

获取原文

摘要

A good front end for visual feature extraction is an importantelement of audio-visual speech recognition systems. We propose a newvisual feature representation that combines both geometric- andpixel-based features. Using our previously developed contour-basedlip-tracking algorithm, geometric features including the height andwidth of the lips are automatically extracted. Lip boundary trackingallows accurate determination of a region of interest from which weconstruct pixel-based features that are robust to variation in scale andtranslation. Motivated by computational considerations, we selected asubset of the pixels in the center of the inner mouth area that wasfound to capture sufficient details of the appearance of the teeth andtongue for assisting in the discrimination of spoken words. We show theadvantage of the combination of these visual features for visual-onlyand audio-visual speech recognition of isolated digits
机译:视觉特征提取的良好前端非常重要 视听语音识别系统的元素。我们提议一个新的 结合了几何和视觉的视觉特征表示 基于像素的功能。使用我们先前开发的基于轮廓的 唇形跟踪算法,包括高度和高度在内的几何特征 嘴唇的宽度会自动提取。嘴唇边界追踪 可以准确确定我们感兴趣的感兴趣区域 构造基于像素的特征,这些特征对于缩放比例和 翻译。基于计算的考虑,我们选择了一个 内口区域中心的像素子集 发现可以捕捉到足够的牙齿外观细节,并且 用于帮助区分口语的舌头。我们展示 这些视觉功能相结合的优势,仅适用于视觉 数字的视听语音识别

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号