首页> 外文会议>Annual conference of the International Speech Communication Association >Synthesizing Photo-Real Talking Head via Trajectory-Guided Sample Selection
【24h】

Synthesizing Photo-Real Talking Head via Trajectory-Guided Sample Selection

机译:通过轨迹引导样品选择综合照片实谈的头部

获取原文

摘要

In this paper, we propose an HMM trajectory-guided, real image sample concatenation approach to photo-real talking head synthesis. It renders a smooth and natural video of articulators in sync with given speech signals. An audio-visual database is used to train a statistical Hidden Markov Model (HMM) of lips movement first and the trained model is then used to generate a visual parameter trajectory of lips movement for given speech signals, all in the maximum likelihood sense. The HMM generated trajectory is then used as a guide to select, in the original training database, an optimal sequence of mouth images which are then stitched back to a background head video. The whole procedure is fully automatic and data driven. With an audio/video footage as short as 20 minutes from a speaker, the proposed system can synthesize a highly photo-real video in sync with the given speech signals. This system won the FIRST place in the Audio-Visual match contest in LIPS2009 Challenge, which was perceptually evaluated by recruited human subjects.
机译:在本文中,我们提出了一种迁移的轨迹引导,真实的图像样本串联方法,可在光真谈话中综合。它与给定语音信号同步呈现出铰接器的平滑和自然视频。音频视觉数据库用于首先训练嘴唇运动的统计隐马尔可夫模型(HMM),然后训练模型用于为给定语音信号产生嘴唇运动的视觉参数轨迹,所有这些都在最大似然意义上。然后将HMM生成的轨迹用作在原始训练数据库中选择的指导,该嘴唇图像的最佳序列图像缝合回背景头视频。整个过程是全自动和数据驱动的。通过距离扬声器短至20分钟的音频/视频镜头,所提出的系统可以与给定的语音信号同步合成高度光实视频。该系统在Lips2009挑战中获得了视听比赛比赛的第一名,这是由招募人类受试者进行感知的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号