首页> 外文会议>Applied Mathematics and Informatics >Auditory-Visual Speech Recognition Error Detection using Longest Common Subsequence Matching of Vowel Sequences

【24h】

Auditory-Visual Speech Recognition Error Detection using Longest Common Subsequence Matching of Vowel Sequences

机译：使用元音序列的最长公共子序列匹配的听觉-视觉语音识别错误检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speech recognition based on only auditory information tends to be very susceptible to external noise. In this study, we investigated how visual information can be useful to improve speech recognition. We can guess a part of talking by an opponent from a shape of his/her mouth. Especially, information about vowels in opponent's talk is able to be sufficiently obtained from the lip shape. The information about an order of sequence of vowels which is found by analyzing the change of the lip shape can be used as an indicator to evaluate confidence for recognition based on an unclear voice. Using the longest common subsequence algorithm, comparison of the order of sequences of vowels resulted from the speech recognition based on voice and one from the lip reading based on version is helpful to improve confidence for the speech recognition. Studies to decrease error ratio of speech recognition allow magnifying application of a technique of speech recognition in a noisy environment.

机译：仅基于听觉信息的语音识别往往非常容易受到外部噪声的影响。在这项研究中，我们调查了视觉信息如何有助于改善语音识别。我们可以从他/她的嘴巴形状中猜测对手讲话的一部分。特别地，能够从嘴唇形状中充分获得关于对方讲话中的元音的信息。通过分析唇形的变化而找到的有关元音顺序的信息可以用作评估基于不清晰语音的识别信心的指标。使用最长的公共子序列算法，比较基于语音的语音识别和基于版本的唇读的元音序列顺序，有助于提高对语音识别的信心。减少语音识别的错误率的研究允许在嘈杂的环境中放大应用语音识别技术。

著录项

来源
《Applied Mathematics and Informatics》|2010年|p.45-50|共6页
会议地点 Athens(GR)
作者
WOOHYUN KO; KWANGHEE LEE; SEUNGJOON LEE; SANGMOO LEE;
展开▼
作者单位

Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

Department of Applied Robot Technology Korea Institute of Industrial Technology 1271-18, Sa3-dong, Sangrok-gu, Ansan KOREA, REPUBLIC OF;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
speech recognition; lip reading; error detection; longest common subsequence;

机译：语音识别;唇读错误检测；最长共同子序列;

相似文献

外文文献
中文文献
专利

1. The effects of auditory-visual vowel identification training on speech recognition under difficult listening conditions. [J] . Richie C, Kewley-Port D Journal of speech, language, and hearing research: JSLHR . 2008,第6期

机译：听觉-视觉元音识别训练对困难听音条件下语音识别的影响。
2. Auditory-visual speech recognition by hearing-impaired subjects: Consonant recognition, sentence recognition, and auditory-visual integration [J] . Ken W. Grant, Brian E. Walden, Philip F. Seitz The Journal of the Acoustical Society of America . 1998,第5期

机译：听力障碍者的听觉-视觉语音识别：辅音识别，句子识别和听觉-视觉融合
3. Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points [J] . Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti Circuits, systems, and signal processing . 2012,第4期

机译：利用元音起始点的精确检测从连续语音中识别和识别辅音元音单元
4. Human Action Recognition using Recursive Self Organizing Map and Longest Common Subsequence Matching [C] . Wei Huang, Jonathan Wu Workshop on Applications of Computer Vision . 2009

机译：使用递归自组织地图和最长的常见后续匹配的人类行动识别
5. Embedding-based subsequence matching in large sequence databases. [D] . Papapetrou, Panagiotis. 2010

机译：大序列数据库中基于嵌入的子序列匹配。
6. Use of Exclusion by a Chimpanzee (Pan troglodytes) During Speech Perception and Auditory-Visual Matching-to-Sample [O] . Michael J. Beran -1

机译：在语音感知和听觉视觉匹配到样本中在讲话中由黑猩猩（PAN Troglodytes）的排除使用
7. Automatic pronunciation error detection in non-native speech : the case of vowel errors in Dutch [O] . Doremalen J.J.H.C. van, Cucchiarini C., Strik H. 2013

机译：非母语语音中的自动发音错误检测：荷兰语中元音错误的情况

Auditory-Visual Speech Recognition Error Detection using Longest Common Subsequence Matching of Vowel Sequences

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅