首页> 外文会议> >Speaker-independent speech-driven facial animation using a hierarchical model

【24h】

Speaker-independent speech-driven facial animation using a hierarchical model

机译：使用分层模型的与说话者无关的语音驱动面部动画

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a system capable of producing video-realistic videos of a speaker given audio only. The audio input signal requires no phonetic labelling and is speaker independent. The system requires only a small training set of video to achieve convincing realistic facial synthesis. The system learns the natural mouth and face dynamics of a speaker to allow new facial poses, unseen in the training video, to be synthesised. To achieve this we have developed a novel approach which utilises a hierarchical and nonlinear PCA model which couples speech and appearance. We show that the model is capable of synthesising videos of a speaker using new audio segments from both previously heard and unheard speakers. The model is highly compact making it suitable for a wide range of real-time applications in multimedia and telecommunications using standard hardware.

机译：我们提出了一个能够产生给定音频的扬声器的视频现实视频的系统。音频输入信号不需要语音标记，并且是扬声器独立的。该系统只需要一组小型训练视频来实现令人信服的现实面部合成。该系统学习扬声器的自然口和面部动态，以允许新的面部姿势，在训练视频中进行看不见。为实现这一点，我们开发了一种新颖的方法，它利用耦合语音和外观的分层和非线性PCA模型。我们表明该模型能够使用先前听说和闻名扬声器的新音频段来合成扬声器的视频。该模型非常紧凑，适用于使用标准硬件的多媒体和电信中的各种实时应用。

著录项

来源
《》|2003年|p.169-172|共4页
会议地点
作者
Cosker; D.P.; Marshall; A.D.; Rosin; P.L.; Hicks; Y.A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术 ;
关键词
computer animation; learning systems; speech processing; principal component analysis; image reconstruction; real-time systems; eigenvalues and eigenfunctions; facial animation; speech-appearance model; hierarchical facial model; nonlinear PCA model; appearance speech association; face reconstruction; principle component analysis; eigenvectors; real time systems; machine learning;

机译：计算机动画;学习系统;语音处理;主成分分析;图像重建;实时系统;特征值和特征函数;面部动画;语音外观模型;层次面部模型;非线性PCA模型;外观语音联想;面部重建;原理成分分析;特征向量;实时系统;机器学习;

相似文献

外文文献
中文文献
专利

1. Speech-driven facial animation using a hierarchical model [J] . D.P. Cosker, A.D. Marshall, P.L. Rosin, IEE proceedings, Part K. Vision, image and signal processing . 2004 ,第4期

机译：使用分层模型的语音驱动面部动画
2. A comparison of acoustic coding models for speech-driven facial animation [J] . Kakumanu P, Esposito A, Garcia ON, Speech Communication . 2006 ,第6期

机译：语音驱动的面部动画的声学编码模型的比较
3. Realistic Speech-Driven Facial Animation with GANs [J] . International Journal of Computer Vision . 2020 ,第5期

机译：与GANS的现实演讲驱动的面部动画
4. SPEAKER-INDEPENDENT SPEECH-DRIVEN FACIAL ANIMATION USING A HIERARCHICAL MODEL [C] . D. P. Cosker, A. D. Marshall, P. L. Rosin, International Conference on Visual Information Engineering . 2003

机译：使用分层模型的扬声器独立的语音驱动的面部动画
5. Expressive speech-driven facial animation. [D] . Cao, Yong. 2005

机译：富有表现力的语音驱动面部动画。
6. A hierarchical model of social perception: Psychophysical evidence suggests late rather than early integration of visual information from facial expression and body posture [O] . Christoph Teufel, Meryl F. Westlake, Paul C. Fletcher, -1

机译：社会感知的分层模型：心理物理学证据表明面部表情和身体姿势的视觉信息整合较晚而不是早期
7. Speech-Driven Facial Animation Using A Shared Gaussian Process Latent Variable Model [O] . Salil Deena, Aphrodite Galata 2012

机译：使用共享高斯过程潜在变量模型的语音驱动面部动画

Speaker-independent speech-driven facial animation using a hierarchical model

摘要

著录项

相似文献

相关主题

期刊订阅