首页> 外文会议>Applied Mathematics and Informatics >Auditory-Visual Speech Recognition Error Detection using Longest Common Subsequence Matching of Vowel Sequences
【24h】

Auditory-Visual Speech Recognition Error Detection using Longest Common Subsequence Matching of Vowel Sequences

机译:使用元音序列的最长公共子序列匹配的听觉-视觉语音识别错误检测

获取原文
获取原文并翻译 | 示例

摘要

Speech recognition based on only auditory information tends to be very susceptible to external noise. In this study, we investigated how visual information can be useful to improve speech recognition. We can guess a part of talking by an opponent from a shape of his/her mouth. Especially, information about vowels in opponent's talk is able to be sufficiently obtained from the lip shape. The information about an order of sequence of vowels which is found by analyzing the change of the lip shape can be used as an indicator to evaluate confidence for recognition based on an unclear voice. Using the longest common subsequence algorithm, comparison of the order of sequences of vowels resulted from the speech recognition based on voice and one from the lip reading based on version is helpful to improve confidence for the speech recognition. Studies to decrease error ratio of speech recognition allow magnifying application of a technique of speech recognition in a noisy environment.
机译:仅基于听觉信息的语音识别往往非常容易受到外部噪声的影响。在这项研究中,我们调查了视觉信息如何有助于改善语音识别。我们可以从他/她的嘴巴形状中猜测对手讲话的一部分。特别地,能够从嘴唇形状中充分获得关于对方讲话中的元音的信息。通过分析唇形的变化而找到的有关元音顺序的信息可以用作评估基于不清晰语音的识别信心的指标。使用最长的公共子序列算法,比较基于语音的语音识别和基于版本的唇读的元音序列顺序,有助于提高对语音识别的信心。减少语音识别的错误率的研究允许在嘈杂的环境中放大应用语音识别技术。

著录项

  • 来源
  • 会议地点 Athens(GR)
  • 作者单位

    Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

    Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

    Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

    Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 应用数学;
  • 关键词

    speech recognition; lip reading; error detection; longest common subsequence;

    机译:语音识别;唇读错误检测;最长共同子序列;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号