首页> 外文会议>IEE Colloquium on Applied Statistical Process Control, 1990 >HMM-based audio-visual speech recognition integrating geometric and appearance-based visual features

【24h】

HMM-based audio-visual speech recognition integrating geometric and appearance-based visual features

机译：基于HMM的视听语音识别集成了几何和基于外观的视觉特征

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A good front end for visual feature extraction is an importantelement of audio-visual speech recognition systems. We propose a newvisual feature representation that combines both geometric- andpixel-based features. Using our previously developed contour-basedlip-tracking algorithm, geometric features including the height andwidth of the lips are automatically extracted. Lip boundary trackingallows accurate determination of a region of interest from which weconstruct pixel-based features that are robust to variation in scale andtranslation. Motivated by computational considerations, we selected asubset of the pixels in the center of the inner mouth area that wasfound to capture sufficient details of the appearance of the teeth andtongue for assisting in the discrimination of spoken words. We show theadvantage of the combination of these visual features for visual-onlyand audio-visual speech recognition of isolated digits

机译：视觉特征提取的良好前端非常重要视听语音识别系统的元素。我们提议一个新的结合了几何和视觉的视觉特征表示基于像素的功能。使用我们先前开发的基于轮廓的唇形跟踪算法，包括高度和高度在内的几何特征嘴唇的宽度会自动提取。嘴唇边界追踪可以准确确定我们感兴趣的感兴趣区域构造基于像素的特征，这些特征对于缩放比例和翻译。基于计算的考虑，我们选择了一个内口区域中心的像素子集发现可以捕捉到足够的牙齿外观细节，并且用于帮助区分口语的舌头。我们展示这些视觉功能相结合的优势，仅适用于视觉数字的视听语音识别

著录项

来源
《IEE Colloquium on Applied Statistical Process Control, 1990 》|1990年|p.9-14|共6页
会议地点
作者
Chan M.T.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Effects of aging on audio-visual speech integration Effects of aging on audio-visual speech integration [J] . Huyse Aurelie, Leybaert Jacqueline, Berthommier Frederic The Journal of the Acoustical Society of America . 2014 ,第4aPta1期

机译：衰老对视听语音整合的影响衰老对视听语音整合的影响
2. Audio-Visual Speech Recognition Using MPEG-4 Compliant Visual Features [J] . Petar S. Aleksic, Jay J. Williams, Zhilin Wu, EURASIP journal on advances in signal processing . 2002 ,第11期

机译：使用符合MPEG-4的视觉功能进行视听语音识别
3. Audio-visual speech recognition integrating 3D lip information obtained from the Kinect [J] . Wang Jianrong, Zhang Ju, Honda Kiyoshi, Multimedia Systems . 2016 ,第3期

机译：整合从Kinect获得的3D嘴唇信息的视听语音识别
4. HMM-based audio-visual speech recognition integrating geometric- and appearance-based visual features [C] . Chan, M.T. . 2001

机译：基于HMM的视听语音识别，融合了基于几何和外观的视觉特征
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Cue Integration in Categorical Tasks: Insights from Audio-Visual Speech Perception [O] . Vikranth Rao Bejjanki, Meghan Clayards, David C. Knill, 2011

机译：类别任务中的提示集成：视听语音感知的见解
7. Hmm-Based Audio-Visual Speech Recognition Integrating Geometric- And Appearance-Based Visual Features [O] . Michael Chan Rockwell, Michael T Chan 2001

机译：基于Hmm的视听语音识别集成了基于几何和外观的视觉特征

HMM-based audio-visual speech recognition integrating geometric and appearance-based visual features

摘要

著录项

相似文献

相关主题

期刊订阅