首页> 外文期刊>IEEE transactions on visualization and computer graphics >Accurate visible speech synthesis based on concatenating variable length motion capture data
【24h】

Accurate visible speech synthesis based on concatenating variable length motion capture data

机译:基于级联可变长度运动捕获数据的准确可见语音合成

获取原文
获取原文并翻译 | 示例
       

摘要

We present a novel approach to synthesizing accurate visible speech based on searching and concatenating optimal variable-length units in a large corpus of motion capture data. Based on a set of visual prototypes selected on a source face and a corresponding set designated for a target face, we propose a machine learning technique to automatically map the facial motions observed on the source face to the target face. In order to model the long distance coarticulation effects in visible speech, a large-scale corpus that covers the most common syllables in English was collected, annotated and analyzed. For any input text, a search algorithm to locate the optimal sequences of concatenated units for synthesis is described. A new algorithm to adapt lip motions from a generic 3D face model to a specific 3D face model is also proposed. A complete, end-to-end visible speech animation system is implemented based on the approach. This system is currently used in more than 60 kindergartens through third grade classrooms to teach students to read using a lifelike conversational animated agent. To evaluate the quality of the visible speech produced by the animation system, both subjective evaluation and objective evaluation are conducted. The evaluation results show that the proposed approach is accurate and powerful for visible speech synthesis.
机译:我们提出了一种新颖的方法,该方法可以基于大量运动捕获数据中的最佳可变长度单位进行搜索和级联来合成准确的可见语音。基于在源面部上选择的一组视觉原型以及为目标面部指定的一组对应的原型,我们提出了一种机器学习技术,可将在源面部上观察到的面部运动自动映射到目标面部。为了模拟可见语音中的长距离发音效果,收集,注释和分析了涵盖英语中最常见音节的大型语料库。对于任何输入文本,都描述了一种搜索算法,用于定位用于合成的串联单元的最佳序列。还提出了一种将嘴唇运动从通用3D面部模型适配到特定3D面部模型的新算法。基于该方法,实现了一个完整的,端到端的可见语音动画系统。目前,该系统已在60多家幼儿园到三年级教室中使用,教学生使用逼真的对话动画代理进行阅读。为了评估动画系统产生的可见语音的质量,进行了主观评估和客观评估。评估结果表明,该方法对可见语音合成准确,有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号