首页> 外文会议>European Signal Processing Conference >Viseme definitions comparison for visual-only speech recognition

【24h】

Viseme definitions comparison for visual-only speech recognition

机译：Viseme定义比较仅用于视觉语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio-visual speech recognition (AVSR) involves recognising of what a speaker is uttering using both audio and visual cues. While phonemes, the units of speech in the audio domain, are well documented, this is not equally true for the speech units in the visual domain: visemes. In the literature, only a generic viseme definition is recognised. There is no agreement on what visemes practically imply, and if they are just related to mouth position or mouth movement. In this paper a visual-only speech recognition system is presented, trained using either PCA or optical flow visual features. Recognition rate changes depending on which practical viseme definition has been used. Four viseme definitions were tested and results are analyzed in order to establish which is, within the 4 candidates, the best performing viseme definition.

机译：视听语音识别（AVSR）涉及使用音频和视觉提示识别说话者在说什么。虽然音素是音频域中的语音单位，但有据可查，但对于视觉域中的语音单位：视位素而言，情况却并非如此。在文献中，仅识别通用的视位素定义。对于哪个假牙实际上意味着什么，以及它们是否仅与嘴部位置或嘴部运动有关，目前尚无共识。本文提出了一种仅视觉的语音识别系统，并使用PCA或光流视觉功能对其进行了训练。识别率根据所使用的实际视位素定义而变化。测试了四个视位定义，并对结果进行了分析，以便确定在四个候选值中哪个是最佳的视位定义。

著录项

来源
《European Signal Processing Conference 》|2011年|2109-2113|共5页
会议地点
作者
Cappelletta Luca; Harte Naomi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. About Neural-Network Algorithms Application in Viseme Classification Problem with Face Video in Audiovisual Speech Recognition Systems [J] . A. V. Savchenko, Ya. I. Khokhlova Optical memory & neural networks . 2014 ,第1期

机译：关于神经网络算法在视听语音识别系统中带有面部视频的Viseme分类问题中的应用
2. Hindi phoneme-viseme recognition from continuous speech [J] . A. N. Mishra, Mahesh Chandra, Astik Biswas, International Journal of Signal and Imaging Systems Engineering . 2013 ,第3期

机译：连续语音对印地语音素的识别
3. A hybrid approach for automatic lip localization and viseme classification to enhance visual speech recognition [J] . Walid Mahdi, Salah Werda, Abdelmajid Ben Hamadou Integrated Computer-Aided Engineering . 2008 ,第3期

机译：自动嘴唇定位和视位分类的混合方法，以增强视觉语音识别
4. VISEME DEFINITIONS COMPARISON FOR VISUAL-ONLY SPEECH RECOGNITION [C] . Luca Cappelletta, Naomi Harte European signal processing conference;EUSIPCO 2011 . 2011

机译：仅可视语音识别的VISEME定义比较
5. A comparison of teachers' training and implementation of speech recognition technology in the business education curriculum in Nebraska. [D] . Grotrian, Judy Ann. 2003

机译：内布拉斯加州商业教育课程中教师培训和语音识别技术实施的比较。
6. Physician recognition and documentation of sepsis. a comparison of the 2001 accp/sccm consensus conference definitions and physician documented diagnosis [O] . RJ Jolley, DW Yergens, H Quan, 2015

机译：医生对败血症的认识和记录。 2001 accp / sccm共识会议定义与医生记录的诊断结果的比较
7. Viseme Definitions Comparison for Visual-Only Speech Recognition [O] . Cappelletta Luca, Harte Naomi 2011

机译：仅用于可视语音识别的Viseme定义比较

Viseme definitions comparison for visual-only speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅