Towards a high quality Finnish talking head

机译：迈向高品质的芬兰会说话的人

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe how our Finnish talking head was improved by using a new auditory speech synthesis method based on neural networks and optimal synchronization of the facial speech animation and the audio signal. In our first version of the talking head, the user typed in text and synthesized auditory speech and synchronized facial animation were created automatically. We combine a 3D facial model with a commercial auditory text-to-speech synthetizer (TTS). The auditory speech is produced by concatenating pre-recorded samples of natural speech according to a set of rules. The quality of the current speech synthesis is not yet adequate. A new strategy has been developed to improve the TTS and to integrate auditory synthesizer synchronization, especially when hardware capabilities are limited. We are developing a new method to achieve an optimal synchronization, independent of the platform used. This method is based on predictive visual synthesis. The new synchronization method gives us better control over audio-visual speech synthesis in the time domain. Using the diphone duration, we can use a more realistic interpolation function between the visemes. Thus, we can also take into account coarticulation effects.

机译：我们描述了如何使用一种新的基于神经网络的听觉语音合成方法以及面部语音动画和音频信号的最佳同步来改善我们的芬兰人的头部。在我们的第一个版本的会说话的头中，用户键入文本并自动创建合成的听觉语音和同步的面部动画。我们将3D面部模型与商业听觉语音合成器（TTS）结合在一起。听觉语音是通过根据一组规则将预先录制的自然语音样本连接起来而产生的。当前语音合成的质量还不够。已经开发出一种新的策略来改善TTS并集成听觉合成器同步，尤其是在硬件功能有限的情况下。我们正在开发一种新的方法，以实现最佳同步，而与所使用的平台无关。此方法基于预测性视觉综合。新的同步方法使我们可以更好地控制时域中的视听语音合成。使用diphone持续时间，我们可以在视位之间使用更逼真的插值函数。因此，我们也可以考虑协同发音的影响。

著录项

来源
《》|1999年|P.433-437|共5页
会议地点
作者
Olives; J.-L.; Sams; M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Talking heads or talking eyes? Effects of head orientation and sudden onset gaze cues on attention capture [J] . van der Wel Robrecht P., Welsh Timothy, Boeckler Anne Attention, perception & psychophysics . 2018 ,第1期

机译：说话头或说话的眼睛？头定位和突发发作凝视提示对注意捕捉的影响
2. Stalk length should be considered for storage quality of broccoli heads based on the investigation of endogenous hormones metabolism [J] . Scientia horticulturae . 2020 ,第期

机译：基于对内源激素新陈代谢的调查，应考虑秸秆长度进行硬甘蓝头的储存质量
3. Stalk length affects the mineral distribution and floret quality of broccoli ( ce:italic>Brassica oleracea/ce:italic> L. var. ce:italic>italica/ce:italic>) heads during storage [J] . Yanyin Guo, Liang Wang, Yong Chen, Postharvest Biology and Technology . 2018 ,第期

机译：秆长度影响西兰花的矿物分布和小花质质量（＆ ce：斜体>芸苔＆ / ce：斜体> l.var。＆ ce：斜体> italica＆ / ce：斜体>）头部
4. Towards a high quality Finnish talking head [C] . Jean-Luc Olives, Mikko Sams, Janne Kulju, IEEE Workshop on Multimedia Signal Processing . 1999

机译：走向高质量的芬兰谈话
5. A framework for automatic creation of talking heads for multimedia applications. [D] . Choi, KyoungHo. 2002

机译：自动创建多媒体应用程序的讲话头的框架。
6. From Talking Heads to Talking Students [O] . Lesley Evans Ogden -1

机译：从会说话的人到会说话的学生
7. HIGH QUALITY LIP-SYNC ANIMATION FOR 3D PHOTO-REALISTIC TALKING HEAD [O] . Lijuan Wang, Wei Han, Frank K. Soong 2014

机译：三维光学真实测量头的高质量唇同步动画

Towards a high quality Finnish talking head

摘要

著录项

相似文献

相关主题

期刊订阅