首页>
外国专利>
Photo-realistic synthesis of image sequences with lip movements synchronized with speech
Photo-realistic synthesis of image sequences with lip movements synchronized with speech
展开▼
机译:嘴唇运动与语音同步的图像序列的逼真合成
展开▼
页面导航
摘要
著录项
相似文献
摘要
Audiovisual data of an individual reading a known script is obtained and stored in an audio library and an image library. The audiovisual data is processed to extract feature vectors used to train a statistical model. An input audio feature vector corresponding to desired speech with which a synthesized image sequence will be synchronized is provided. The statistical model is used to generate a trajectory of visual feature vectors that corresponds to the input audio feature vector. These visual feature vectors are used to identify a matching image sequence from the image library. The resulting sequence of images, concatenated from the image library, provides a photorealistic image sequence with lip movements synchronized with the desired speech.
展开▼