首页> 外文期刊>Journal of VLSI signal processing >Hidden Markov Model Inversion for Audio-to-Visual Conversion in an MPEG-4 Facial Animation System
【24h】

Hidden Markov Model Inversion for Audio-to-Visual Conversion in an MPEG-4 Facial Animation System

机译:MPEG-4面部动画系统中用于视听转换的隐马尔可夫模型反演

获取原文
获取原文并翻译 | 示例
       

摘要

MPEG-4 standard allows composition of natural or synthetic video with facial animation. Based on this standard, an animated face can be inserted into natural or synthetic video to create new virtual working environments such as virtual meetings or virtual collaborative environments. For these applications, audio-to-visual conversion techniques can be used to generate a talking face that is synchronized with the voice. In this paper, we address audio-to-visual conversion problems by introducing a novel Hidden Markov Model Inversion (HMMI) method. In training audio-visual HMMs, the model parameters {λ_av} can be chosen to optimize some criterion such as maximum likelihood. In inversion of audio-visual HMMs, visual parameters that optimize some criterion can be found based on given speech and model parameters {λ_av}. By using the proposed HMMI technique, an animated talking face can be synchronized with audio and can be driven realistically. The HMMI technique combined with MPEG-4 standard to create a virtual conference system, named VIRTUAL-FACE, is introduced to show the role of HMMI for applications of MPEG-4 facial animation.
机译:MPEG-4标准允许通过面部动画合成自然或合成视频。基于此标准,可以将动画面孔插入自然或合成视频中,以创建新的虚拟工作环境,例如虚拟会议或虚拟协作环境。对于这些应用程序,可以使用音频到视频转换技术来生成与语音同步的会说话的脸。在本文中,我们通过介绍一种新颖的隐马尔可夫模型反演(HMMI)方法来解决视听转换问题。在训练视听HMM中,可以选择模型参数{λ_av}来优化某些准则,例如最大似然。在视听HMM的反演中,可以基于给定的语音和模型参数{λ_av}找到优化某些标准的视觉参数。通过使用提出的HMMI技术,可以将动画说话的脸与音频同步,并可以实际驱动。引入了HMMI技术和MPEG-4标准,以创建一个名为VIRTUAL-FACE的虚拟会议系统,以展示HMMI在MPEG-4面部动画应用中的作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号