首页> 外文会议> >Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes
【24h】

Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes

机译:可视化语音:使用声学视位素的面部动画连续语音识别系统

获取原文
获取外文期刊封面目录资料

摘要

This paper presents an acoustic viseme based continuous speech recognition system for speech driven talking face animation. The system is developed using viseme HMMs with acoustic speech as input only. Triseme HMMs are adopted to reflect the mouth shape contexts. Visual decision trees are introduced to get robust parameter training for triseme HMMs with the limited training data. In the tree building process, methods based on lip rounding and similarity of viseme shapes are introduced to design visual questions. The results from objective and subjective evaluations show that the talking face animation based on the speech recognition system provided by this paper outperforms the conventional phoneme based one, and it is possible to obtain visually relevant speech segmentation information from acoustic speech signal only.
机译:本文提出了一种基于声学视位素的连续语音识别系统,用于语音驱动的说话人脸动画。该系统是使用Viseme HMM开发的,仅将语音作为输入。 Triseme HMM被采用来反映嘴形环境。引入了视觉决策树,以利用有限的训练数据为Triseme HMM进行可靠的参数训练。在树的构建过程中,引入了基于嘴唇倒圆和视位形状相似性的方法来设计视觉问题。主客观评价的结果表明,基于本文提供的语音识别系统的会说话的脸动画优于传统的基于音素的语音,并且仅从声学语音信号中获得视觉上相关的语音分割信息是可能的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号