首页> 外文期刊>International Journal of Image and Graphics >VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES
【24h】

VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES

机译:利用动态特征和支持向量机进行视觉识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a vision based technique to identify the unspoken phones using a small camera that is located on the headset of the speaker. The system is based on temporal integration of the video data to generate motion history image (MHI). The paper proposes the use of global features to classify the MHI and compares the use of image moments with Discrete Cosine Transform (DCT). A comparison between Zernike moments (ZM) with DCT indicates that while the accuracy of classification for both techniques is very comparable (96% for ZM and 94% for DCT) when there is no relative motion between the camera and the mouth, ZM is resilient to rotation of the camera and continues to gives good results despite rotation but DCT is sensitive to rotation. Based on the accuracy of the system and its resilience to movement artefacts such as rotation, the authors propose the use of such a system for human computer interface. Such a system could be invaluable when it is important to communicate without making a sound, such as giving passwords when in an open office or in public spaces.
机译:本文提出了一种基于视觉的技术,可以使用扬声器耳麦上的小型摄像头识别未说出的电话。该系统基于视频数据的时间积分以生成运动历史图像(MHI)。本文提出使用全局特征对MHI进行分类,并比较图像矩和离散余弦变换(DCT)的使用。 Zernike矩(ZM)与DCT的比较表明,尽管在相机和嘴巴之间没有相对运动时,两种技术的分类精度非常可比(ZM为96%,DCT为94%),但是ZM具有弹性相机旋转,尽管旋转,仍然可以提供良好的效果,但DCT对旋转很敏感。基于该系统的准确性及其对运动伪影(例如旋转)的适应性,作者建议将这种系统用于人机界面。当在不发出声音的情况下进行交流很重要时(例如在开放式办公室或公共场所中输入密码时),这种系统可能是无价的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号