Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling

Lei Xie; Zhi-Qiang Liu

首页> 外文期刊>IEEE transactions on multimedia >Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling

【24h】

Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling

机译：使用发音模型对语音驱动的说话人脸进行逼真的嘴部同步

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an articulatory modelling approach to convert acoustic speech into realistic mouth animation. We directly model the movements of articulators, such as lips, tongue, and teeth, using a dynamic Bayesian network (DBN)-based audio-visual articulatory model (AVAM). A multiple-stream structure with a shared articulator layer is adopted in the model to synchronously associate the two building blocks of speech, i.e., audio and video. This model not only describes the synchronization between visual articulatory movements and audio speech, but also reflects the linguistic fact that different articulators evolve asynchronously. We also present a Baum-Welch DBN inversion (DBNI) algorithm to generate optimal facial parameters from audio given the trained AVAM under maximum likelihood (ML) criterion. Extensive objective and subjective evaluations on the JEWEL audio-visual dataset demonstrate that compared with phonemic HMM approaches, facial parameters estimated by our approach follow the true parameters more accurately, and the synthesized facial animation sequences are so lively that 38% of them are undistinguishable

机译：本文提出了一种将声音转换成逼真的嘴部动画的发音建模方法。我们使用基于动态贝叶斯网络（DBN）的视听发音模型（AVAM）直接对发音器（例如嘴唇，舌头和牙齿）的运动进行建模。该模型中采用了具有共享发音器层的多流结构，以同步关联语音的两个基本组成部分，即音频和视频。该模型不仅描述了视觉发音运动和音频语音之间的同步，而且反映了不同发音器异步发展的语言事实。我们还提出了Baum-Welch DBN反演（DBNI）算法，以在最大似然（ML）准则下给定受过训练的AVAM的情况下，从音频生成最佳面部参数。对JEWEL视听数据集的广泛的主观评估表明，与音素HMM方法相比，我们的方法估计的面部参数更准确地遵循了真实参数，并且合成的面部动画序列是如此生动，以至于其中38％的面部动画序列无法区分

著录项

来源
《IEEE transactions on multimedia》 |2007年第2007期|p.500-510|共11页
作者
Lei Xie; Zhi-Qiang Liu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
belief networks; computer animation; learning (artificial intelligence); maximum likelihood estimation; speech-based user interfaces; Baum-Welch DBN inversion algorithm; acoustic speech; articulatory modelling; audio-visual articulatory model; dynamic Bayesian ne;

机译：信念网络;计算机动画;学习（人工智能）;最大似然估计;基于语音的用户界面;Baum-Welch DBN倒置算法;声学语音;发音建模;视听发音模型;动态贝叶斯网络;

相似文献

外文文献
中文文献
专利

1. Speech driven photo realistic facial animation based on an articulatory DBN model and AAM features [J] . Dongmei Jiang, Yong Zhao, Hichem Sahli, Multimedia Tools and Applications . 2014,第1期

机译：基于发音DBN模型和AAM功能的语音驱动的照片逼真的面部动画
2. Realistic Modeling of Far-End Crosstalk in Metallic Cables [J] . Lafata Pavel Communications Letters, IEEE . 2013,第3期

机译：金属电缆远端串扰的逼真建模
3. Realistic modelling of receptor activation in hippocampal excitatory synapses: analysis of multivesicular release, release location, temperature and synaptic cross-talk. [J] . Boucher J, Kroger H, Sik A Brain structure & function . 2010,第1期

机译：海马兴奋性突触中受体激活的现实模型：多囊泡释放，释放位置，温度和突触串扰的分析。
4. Mother: A New Generation of Talking Heads Providing a Flexible Articulatory Control For Video-Realistic Speech Animation [C] . Lionel Reveret, Gerard Bailly, Pierre Badin 6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：母亲：新一代会说话的头，为视频般逼真的语音动画提供灵活的发音控制
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. On smoothing articulatory trajectories obtained from Gaussian mixture model basedacoustic-to-articulatory inversion [O] . Prasanta K. Ghosh, a), Shrikanth S. Narayanan -1

机译：基于高斯混合模型获得的平滑运动轨迹声音到发音的反转
7. Realistic Speech-Driven Talking Video Generation with Personalized Pose [O] . Xu Zhang, Liguo Weng 2020

机译：具有个性化姿势的现实演讲驱动的谈话视频生成

Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling

摘要

著录项

相似文献

相关主题

期刊订阅