首页> 外国专利> Photo-realistic synthesis of image sequences with lip movements synchronized with speech

Photo-realistic synthesis of image sequences with lip movements synchronized with speech

机译:嘴唇运动与语音同步的图像序列的逼真合成

摘要

Audiovisual data of an individual reading a known script is obtained and stored in an audio library and an image library. The audiovisual data is processed to extract feature vectors used to train a statistical model. An input audio feature vector corresponding to desired speech with which a synthesized image sequence will be synchronized is provided. The statistical model is used to generate a trajectory of visual feature vectors that corresponds to the input audio feature vector. These visual feature vectors are used to identify a matching image sequence from the image library. The resulting sequence of images, concatenated from the image library, provides a photorealistic image sequence with lip movements synchronized with the desired speech.
机译:获得阅读已知脚本的个人的视听数据,并将其存储在音频库和图像库中。处理视听数据以提取用于训练统计模型的特征向量。提供与期望的语音相对应的输入音频特征向量,合成图像序列将与该期望的语音同步。统计模型用于生成与输入音频特征向量相对应的视觉特征向量的轨迹。这些视觉特征向量用于从图像库中识别匹配的图像序列。从图像库连接的结果图像序列提供了逼真的图像序列,其唇部运动与所需语音同步。

著录项

  • 公开/公告号US9728203B2

    专利类型

  • 公开/公告日2017-08-08

    原文格式PDF

  • 申请/专利权人 LIJUAN WANG;FRANK SOONG;

    申请/专利号US201113098488

  • 发明设计人 LIJUAN WANG;FRANK SOONG;

    申请日2011-05-02

  • 分类号G10L21/10;

  • 国家 US

  • 入库时间 2022-08-21 13:42:26

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号