首页> 外文会议>International Conference on Computing, Communication and Applications >Lip feature extraction for visual speech recognition using Hidden Markov Model
【24h】

Lip feature extraction for visual speech recognition using Hidden Markov Model

机译:利用隐马尔可夫模型进行视觉语音识别的唇部特征提取

获取原文

摘要

Visual speech recognition refers to recognizing the spoken words based on visual information of lip movements. In this paper, a new approach for lip reading is presented. Visual speech recognition is applied in science areas, such as speech recognition system and also in social activities, such as recognizing the spoken words of hearing impaired persons. The visual speech video is given as input to the face localization module for detecting the face region. The mouth region is determined relative to the face region. Different methods were used for feature extraction. Out of the different feature extraction methods, the 16 point DCT method gives the experimental results of 93.5% of performance accuracy. Then, these feature vectors are applied separately as inputs to the Hidden Markov Model (HMM) for recognizing the visual speech. 10 participants were uttered 35 different isolated words. For each word, 20 samples are collected for training and testing the HMM.
机译:视觉语音识别是指基于唇部运动的视觉信息来识别口语词。 在本文中,提出了一种新方法。 视觉语音识别适用于科学领域,例如语音识别系统,也在社交活动中,例如识别听力受损人口的口语。 视觉语音视频被给出为用于检测面部区域的面部定位模块的输入。 相对于面部区域确定嘴区域。 使用不同的方法进行特征提取。 出不同特征提取方法,16点DCT方法为实验结果提供了93.5%的性能准确性。 然后,这些特征向量被单独应用为对隐藏的马尔可夫模型(HMM)的输入,用于识别视觉语音。 10名参与者被列出了35个不同的孤立词语。 对于每个单词,收集20个样本以进行培训和测试嗯。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号