首页> 外文期刊>IEEE transactions on multimedia >Speech-driven facial animation with realistic dynamics
【24h】

Speech-driven facial animation with realistic dynamics

机译:具有逼真的动态的语音驱动面部动画

获取原文
获取原文并翻译 | 示例
           

摘要

This work presents an integral system capable of generating animations with realistic dynamics, including the individualized nuances, of three-dimensional (3-D) human faces driven by speech acoustics. The system is capable of capturing short phenomena in the orofacial dynamics of a given speaker by tracking the 3-D location of various MPEG-4 facial points through stereovision. A perceptual transformation of the speech spectral envelope and prosodic cues are combined into an acoustic feature vector to predict 3-D orofacial dynamics by means of a nearest-neighbor algorithm. The Karhunen-Loe/spl acute/ve transformation is used to identify the principal components of orofacial motion, decoupling perceptually natural components from experimental noise. We also present a highly optimized MPEG-4 compliant player capable of generating audio-synchronized animations at 60 frames/s. The player is based on a pseudo-muscle model augmented with a nonpenetrable ellipsoidal structure to approximate the skull and the jaw. This structure adds a sense of volume that provides more realistic dynamics than existing simplified pseudo-muscle-based approaches, yet it is simple enough to work at the desired frame rate. Experimental results on an audiovisual database of compact TIMIT sentences are presented to illustrate the performance of the complete system.
机译:这项工作提出了一个整体系统,该系统能够生成具有逼真的动态效果的动画,包括由语音声学驱动的三维(3-D)人脸的个性化细微差别。该系统能够通过立体视觉跟踪各种MPEG-4面部点的3-D位置,从而捕获给定说话者口腔动态中的短现象。语音频谱包络和韵律提示的感知转换被组合到声学特征向量中,以通过最近邻算法预测3-D口腔动力学。 Karhunen-Loe / spl急性/ ve变换用于识别口腔运动的主要成分,从而将感知上的自然成分与实验噪声分离。我们还提供了高度优化的MPEG-4兼容播放器,能够以60帧/秒的速度生成音频同步动画。玩家基于伪肌肉模型,该模型具有不可穿透的椭球结构,以近似头骨和颌骨。与现有的简化的基于伪肌肉的方法相比,此结构增加了体积感,可提供更逼真的动态效果,但它足够简单,可以以所需的帧速率工作。给出了紧凑的TIMIT句子的视听数据库的实验结果,以说明整个系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号