首页> 外文会议> >Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes

【24h】

Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes

机译：可视化语音：使用声学视位素的面部动画连续语音识别系统

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents an acoustic viseme based continuous speech recognition system for speech driven talking face animation. The system is developed using viseme HMMs with acoustic speech as input only. Triseme HMMs are adopted to reflect the mouth shape contexts. Visual decision trees are introduced to get robust parameter training for triseme HMMs with the limited training data. In the tree building process, methods based on lip rounding and similarity of viseme shapes are introduced to design visual questions. The results from objective and subjective evaluations show that the talking face animation based on the speech recognition system provided by this paper outperforms the conventional phoneme based one, and it is possible to obtain visually relevant speech segmentation information from acoustic speech signal only.

机译：本文提出了一种基于声学视位素的连续语音识别系统，用于语音驱动的说话人脸动画。该系统是使用Viseme HMM开发的，仅将语音作为输入。 Triseme HMM被采用来反映嘴形环境。引入了视觉决策树，以利用有限的训练数据为Triseme HMM进行可靠的参数训练。在树的构建过程中，引入了基于嘴唇倒圆和视位形状相似性的方法来设计视觉问题。主客观评价的结果表明，基于本文提供的语音识别系统的会说话的脸动画优于传统的基于音素的语音，并且仅从声学语音信号中获得视觉上相关的语音分割信息是可能的。

著录项

来源
《》|2003年|p.872-875|共4页
会议地点
作者
Xie Lei; Jiang Dongmei; Ravyse; I.; Zhao Rongchun; Verhelst; W.; Sahli; H.; Conlenis; J.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
speech recognition; hidden Markov models; computer animation; acoustic viseme based continuous speech recognition system; speech driven talking face animation; triseme HMM; hidden Markov models; lip rounding; training data; speech segmentation; objective evaluations; subjective evaluations;

机译：语音识别;隐马尔可夫模型;计算机动画;基于音素的连续语音识别系统;语音驱动的说话人脸动画; triseme HMM;隐马尔可夫模型;嘴唇舍入;训练数据;语音分割;客观评价;主观评价;

相似文献

外文文献
中文文献
专利

1. Hindi phoneme-viseme recognition from continuous speech [J] . A. N. Mishra, Mahesh Chandra, Astik Biswas, International Journal of Signal and Imaging Systems Engineering . 2013,第3期

机译：连续语音对印地语音素的识别
2. About Neural-Network Algorithms Application in Viseme Classification Problem with Face Video in Audiovisual Speech Recognition Systems [J] . A. V. Savchenko, Ya. I. Khokhlova Optical memory & neural networks . 2014,第1期

机译：关于神经网络算法在视听语音识别系统中带有面部视频的Viseme分类问题中的应用
3. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
4. VISUALIZE SPEECH: A CONTINUOUS SPEECH RECOGNITION SYSTEM FOR FACIAL ANIMATION USING ACOUSTIC VISEMES [C] . Xie Lei, Jiang Dongmei, Use Ravyse, International Conference on Neural Networks and Signal Processing . 2003

机译：可视化语音：使用声学探测的面部动画连续语音识别系统
5. Large-vocabulary speaker-independent continuous speech recognition: The SPHINX system. [D] . Lee, Kai-Fu. 1988

机译：独立于大词汇的说话者的连续语音识别：SPHINX系统。
6. Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models [O] . A. Paats, T. Alumäe, E. Meister, 2018

机译：一项爱沙尼亚放射线语音识别系统临床表现的回顾性分析：不同声学和语言模型的影响
7. Beyond visemes: Using disemes in synthetic speech with facial animation [O] . Caroline Henton 1994

机译：超越探测：使用综合演讲与面部动画中的孤独
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Volume III. Acoustical Characteristics of Speech Sounds Systematically Arranged in Form of Tables [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。第三卷。以表格形式系统地排列的语音的声学特征

Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅