首页> 外文会议>National Conference on Biomedical Engineering >Lip-Reading via Deep Neural Network Using Appearance-Based Visual Features
【24h】

Lip-Reading via Deep Neural Network Using Appearance-Based Visual Features

机译:利用基于外观的视觉功能的深神经网络唇读

获取原文

摘要

Lip-reading, is visually interpreting lips movements in order to understand speech, when there is no access to the normal sound. Image processing techniques for lip-reading recognition has been widely applied in various kinds of applications. As an application, computer-based video system developed to provide lip-reading instruction to hearing-impaired adults and teenagers. Taking a step toward automating the process, challenges such as coarticulation phenomenon, homophone effect, insufficient training data per class, choice of features and speaker-dependency are faced. Finding a method to overcome these challenges is desirable. This paper describes a lip-reading model, highlighting the feature extraction and recognition parts. Certain arrangement of blocks are considered in a way to achieve optimal appearance-based features for feature extraction part, while a properly structured Deep Belief Network (DBN) is used for the recognition part. The challenging dataset of CUAVE is used in this study, and visual phone recognition (VPR) accuracies are reported on the phone-level. Proposed lip-reading recognizer is unique in its usage for all speakers. Our suggested method outperforms the conventional Hidden Markov Model (HMM)-based recognizer, and the best VPR accuracy of %45.63 is achieved, using the best DBN.
机译:唇读,在视觉上解释嘴唇运动以便理解语音,当没有进入正常声音时。用于唇读识别的图像处理技术已广泛应用于各种应用。作为一种应用,基于计算机的视频系统开发,为听力障碍的成年人和青少年提供唇读指令。迈向自动化过程,诸如Coarticulation现象,同音静音效应,每阶级训练不足,特征和扬声器依赖性的挑战面临的挑战。寻找一种克服这些挑战的方法是可取的。本文介绍了唇读模型,突出显示特征提取和识别部件。以一种方式考虑某些块的布置,以实现特征提取部分的最佳外观的特征,而适当的结构深度信仰网络(DBN)用于识别部分。在本研究中使用Cuave的具有挑战性的数据集,并且在电话级报告了可视电话识别(VPR)精度。提出的唇读识别器在所有扬声器的使用中是独一无二的。我们建议的方法优于基于传统的隐马尔可夫模型(HMM)的识别器,并且使用最佳DBN实现了%45.63的最佳VPR精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号