3D Realistic Talking Face Co-Driven by Text and Speech

机译：3d由文本和讲话协作的现实谈的脸

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To create 3D realistic talking face has been a challenge for a long time. Previous works emphasize text or speech driven talking face respectively while the animation result is not very realistic or natural- looking. In the proposed approach, text and speech are considered to drive the 3D talkingface coordinately. The text is translated into a sequence of visemes' transcription. And time vector of the sequence is extracted from the speech corresponding to the text after it is segmented into phonetic sequence. A muscle based viseme vector is defined for static viseme. And then, with the time vector and the static visemes's sequence, dynamic visemes are generated through time-related dominance function. Finally, according to the frame rate to be rendered, intermediate frames are interpolated between key frames to make the animation result looks more natural and realistic than those obtained based on the text or speech-driven only.

机译：创建3D现实谈话脸一直是挑战。以前的作品分别强调文本或语音驱动的讨论脸，而动画结果不是很现实或自然的。在所提出的方法中，文本和语音被认为是协调的方式驾驶3D谈话。该文本被翻译成一系列鼠标转录。在将其分段为语音序列之后，从对应于文本的语音中提取序列的时间向量。肌肉基因载体被定义为静态粘性。然后，通过时间向量和静态Visemes序列，通过时间相关的优势函数生成动态探测。最后，根据要渲染的帧速率，在关键帧之间插值中间帧以使动画结果看起来比基于文本或语音驱动的那些更自然和现实。

著录项

来源
《IEEE Interantional Conference on Systems, Man, and Cybernetics》|2003年||共6页
会议地点
作者
Mingli Song; Chun Chen; Jiajun Bu; Ronghua Liang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP13-53;
关键词
Visemes' transcription; Speech segmentation; Time vector extraction; Static viseme; Dynamic visemes generation;

机译：探视转录;语音分割;时间向量提取;静态视野;动态粘性生成;

相似文献

外文文献
中文文献
专利

1. A statistical parametric approach to video-realistic text-driven talking avatar [J] . Lei Xie, Naicai Sun, Bo Fan Multimedia Tools and Applications . 2014,第1期

机译：统计参数化方法，用于视频逼真的文本驱动的说话头像
2. Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling [J] . Lei Xie, Zhi-Qiang Liu IEEE transactions on multimedia . 2007,第期

机译：使用发音模型对语音驱动的说话人脸进行逼真的嘴部同步
3. Texts for Talking: Evaluation of a Mobile Health Program Addressing Speech and Language Delay [J] . Olson Kaitlyn B., Wilkinson Carol L., Wilkinson M. Jackson, Clinical Pediatrics . 2016,第11期

机译：谈话文本：评估针对语音和语言延迟的移动健康计划
4. 3D realistic talking face co-driven by text and speech [C] . Mingli Song, Chun Chen, Jiajun Bu, . 2003

机译：文字和语音共同驱动的3D现实说话面孔
5. Right to Be Forgotten or Right to Not Be Talked About? Public and Private Speech Regulation and the Panic About Critical Speech on the Interactive Web. [D] . Medeiros, Benjamin A. 2016

机译：被遗忘的权利或未被谈论的权利？交互式网络上的公共和私人语音监管以及关于批评性语音的恐慌。
6. Visual Speech Benefit in Clear and Degraded Speech Depends on the Auditory Intelligibility of the Talker and the Number of Background Talkers [O] . Catherine L. Blackburn, Pádraig T. Kitterick, Gary Jones, 2019

机译：清晰语音中的视觉语音收益取决于讲话者的听觉清晰度和背景讲话者的数量
7. Image Talk: A Real Time Synthetic Talking Head Using One Single Image with Chinese Text-To-Speech Capability [O] . Woei-luen Perng, Yungkang Wu, Ming Ouhyoung 1998

机译：Image Talk：一个具有中文文本语音能力的单一图像的实时合成会说话头

3D Realistic Talking Face Co-Driven by Text and Speech

摘要

著录项

相似文献

相关主题

期刊订阅