VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES

WAI CHEE YAU; DINESH KANT KUMAR; SRIDHAR POOSAPADI ARJUNAN

首页> 外文期刊>International Journal of Image and Graphics >VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES

【24h】

VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES

机译：利用动态特征和支持向量机进行视觉识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a vision based technique to identify the unspoken phones using a small camera that is located on the headset of the speaker. The system is based on temporal integration of the video data to generate motion history image (MHI). The paper proposes the use of global features to classify the MHI and compares the use of image moments with Discrete Cosine Transform (DCT). A comparison between Zernike moments (ZM) with DCT indicates that while the accuracy of classification for both techniques is very comparable (96% for ZM and 94% for DCT) when there is no relative motion between the camera and the mouth, ZM is resilient to rotation of the camera and continues to gives good results despite rotation but DCT is sensitive to rotation. Based on the accuracy of the system and its resilience to movement artefacts such as rotation, the authors propose the use of such a system for human computer interface. Such a system could be invaluable when it is important to communicate without making a sound, such as giving passwords when in an open office or in public spaces.

机译：本文提出了一种基于视觉的技术，可以使用扬声器耳麦上的小型摄像头识别未说出的电话。该系统基于视频数据的时间积分以生成运动历史图像（MHI）。本文提出使用全局特征对MHI进行分类，并比较图像矩和离散余弦变换（DCT）的使用。 Zernike矩（ZM）与DCT的比较表明，尽管在相机和嘴巴之间没有相对运动时，两种技术的分类精度非常可比（ZM为96％，DCT为94％），但是ZM具有弹性相机旋转，尽管旋转，仍然可以提供良好的效果，但DCT对旋转很敏感。基于该系统的准确性及其对运动伪影（例如旋转）的适应性，作者建议将这种系统用于人机界面。当在不发出声音的情况下进行交流很重要时（例如在开放式办公室或公共场所中输入密码时），这种系统可能是无价的。

著录项

来源
《International Journal of Image and Graphics》 |2008年第3期|共19页
作者
WAI CHEE YAU; DINESH KANT KUMAR; SRIDHAR POOSAPADI ARJUNAN;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词
Visual speech recognition; Motion segmentation; Zernike moments; Discrete cosine transform; Support vector machines; Hidden Markov models;

机译：视觉语音识别;运动分割;泽尼克矩;离散余弦变换;支持向量机;隐马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES [J] . WAI CHEE YAU, DINESH KANT KUMAR, SRIDHAR POOSAPADI ARJUNAN International Journal of Image and Graphics . 2008,第3期

机译：利用动态特征和支持向量机进行视觉识别
2. VISUAL SPEECH RECOGNITION USING OPTICAL FLOW AND SUPPORT VECTOR MACHINES [J] . AYAZ A. SHAIKH, DINESH K. KUMAR, JAYAVARDHANA GUBBI International Journal of Computational Intelligence and Applications . 2011,第2期

机译：使用光学流和支持矢量机的视觉识别
3. VISUAL SPEECH RECOGNITION USING OPTICAL FLOW AND SUPPORT VECTOR MACHINES [J] . AYAZ A. SHAIKH∗ and DINESH K. KUMARJAYAVARDHANA GUBBIS International Journal of Computational Intelligence and Applications . 2011,第2期

机译：使用光学流和支持矢量机的视觉识别
4. Application of support vector machines classifiers to visual speech recognition [C] . Gordan M., Kotropoulos C., Pitas I. Image Processing. 2002. Proceedings. 2002 International Conference on . 2002

机译：支持向量机分类器在视觉语音识别中的应用
5. Support vector machines for speech recognition. [D] . Ganapathiraju, Aravind. 2002

机译：支持向量机用于语音识别。
6. ir-HSP: Improved Recognition of Heat Shock Proteins Their Families and Sub-types Based On g-Spaced Di-peptide Features and Support Vector Machine [O] . Prabina K. Meher, Tanmaya K. Sahu, Shachi Gahoi, 2017

机译：ir-HSP：基于g间隔二肽特征和支持向量机的热休克蛋白其家族和亚型的改进识别
7. A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications [O] . 2002

机译：用于视觉语音识别应用的基于支持向量机的动态网络

VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅