首页> 外文期刊>The Visual Computer >Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments
【24h】

Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments

机译:使用定向运动历史图像和Zernike矩自动进行视觉语音分割和识别

获取原文
获取原文并翻译 | 示例

摘要

Appearance-based visual speech recognition using only video signals is presented. The proposed technique is based on the use of directional motion history images (DMHIs), which is an extension of the popular optical-flow method for object tracking. Zernike moments of each DMHI are computed in order to perform the classification. The technique incorporates automatic temporal segmentation of isolated utterances. The segmentation of isolated utterance is achieved using pair-wise pixel comparison. Support vector machine is used for classification and the results are based on leave-one-out paradigm. Experimental results show that the proposed technique achieves better performance in visemes recognition than others reported in literature. The benefit of this proposed visual speech recognition method is that it is suitable for real-time applications due to quick motion tracking system and the fast classification method employed. It has applications in command and control using lip movement to text conversion and can be used in noisy environment and also for assisting speech impaired persons.
机译:提出了仅使用视频信号的基于外观的视觉语音识别。所提出的技术基于定向运动历史图像(DMHI)的使用,它是流行的用于对象跟踪的光流方法的扩展。计算每个DMHI的Zernike矩以便执行分类。该技术结合了孤立话语的自动时间分段。使用逐对像素比较可实现孤立话语的分段。支持向量机用于分类,结果基于留一法范式。实验结果表明,与文献报道的其他技术相比,该技术在语音识别方面具有更好的性能。所提出的视觉语音识别方法的优点在于,由于采用了快速运动跟踪系统和快速分类方法,因此适合实时应用。它在通过嘴唇移动到文本转换的命令和控制中具有应用,可以在嘈杂的环境中使用,也可以帮助语音障碍者。

著录项

  • 来源
    《The Visual Computer》 |2013年第10期|969-982|共14页
  • 作者单位

    School of Electrical and Computer Engineering and Health Innovations Research Institute, RMIT University, Melbourne, Vic 3001, Australia;

    School of Electrical and Computer Engineering and Health Innovations Research Institute, RMIT University, Melbourne, Vic 3001, Australia;

    ISSNIP, Dept of Electrical and Electronic Engineering, The University of Melbourne, Melbourne, Vic 3010, Australia;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Motion analysis; Temporal segmentation; Directional motion history image; Optical flow; Zernike moments;

    机译:运动分析;时间分割;定向运动历史图像;光流泽尼克时刻;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号