Expressive Speech-Driven Lip Movements with Multitask Learning

机译：具有多任务学习能力的语音驱动唇部动作

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The orofacial area conveys a range of information, including speech articulation and emotions. These two factors add constraints to the facial movements, creating non-trivial integrations and interplays. To generate more expressive and naturalistic movements for conversational agents (CAs) the relationship between these factors should be carefully modeled. Data-driven models are more appropriate for this task than rule-based systems. This paper provides two deep learning speech-driven structures to integrate speech articulation and emotional cues. The proposed approaches rely on multitask learning (MTL) strategies, where related secondary tasks are jointly solved when synthesizing orofacial movements. In particular, we evaluate emotion recognition and viseme recognition as secondary tasks. The approach creates shared representations that generate behaviors that not only are closer to the original orofacial movements, but also are perceived more natural than the results from single task learning.

机译：口腔区域传达了一系列信息，包括语音清晰度和情感。这两个因素增加了面部运动的约束，创造了非同寻常的融合和相互作用。为了使对话代理（CA）产生更具表现力和自然主义的运动，应该仔细模拟这些因素之间的关系。数据驱动模型比基于规则的系统更适合此任务。本文提供了两种深度学习语音驱动的结构，以整合语音清晰度和情感提示。所提出的方法依赖于多任务学习（MTL）策略，其中在合成口腔运动时共同解决相关的次要任务。尤其是，我们将情感识别和视位识别视为次要任务。该方法创建共享的表示，这些表示产生的行为不仅比原始的口腔运动更接近，而且比单个任务学习的结果更自然。

著录项

来源
《2018 13th IEEE International Conference on Automatic Face amp; Gesture Recognition》|2018年|409-415|共7页
会议地点 Xian(CN)
作者
Najmeh Sadoughi; Carlos Busso;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Hidden Markov models; Lips; Task analysis; Emotion recognition; Feature extraction; Training; Face;

机译：隐马尔可夫模型;嘴唇;任务分析;情绪识别;特征提取;训练;面部;;
入库时间 2022-08-26 14:32:04

相似文献

外文文献
中文文献
专利

1. Real-time speech-driven animation of expressive talking faces [J] . Jia Liu, Mingyu You, Chun Chen, International journal of general systems . 2011,第4期

机译：实时语音驱动的表情对话动画
2. Expressive Speech-Driven Facial Animation [J] . YONG CAO, WEN C. TIEN, PETROS FALOUTSOS, ACM Transactions on Graphics . 2005,第4期

机译：表达性语音驱动的面部动画
3. A Practical Model for Live Speech-Driven Lip-Sync [J] . Wei Li, Deng Zhigang Computer Graphics and Applications, IEEE . 2015,第2期

机译：实时语音驱动口型同步的实用模型
4. Expressive Speech-Driven Lip Movements with Multitask Learning [C] . Najmeh Sadoughi, Carlos Busso IEEE International Conference on Automatic Face Gesture Recognition . 2018

机译：具有多任务学习的表现言语驱动的唇部运动
5. Expressive speech-driven facial animation. [D] . Cao, Yong. 2005

机译：富有表现力的语音驱动面部动画。
6. Receptive and Expressive Vocabulary Skills and Their Correlates in Mandarin-Speaking Infants with Unrepaired Cleft Lip and/or Palate [O] . Si-Wei Ma, Li Lu, Ting-Ting Zhang, 2020

机译：唇裂和/或or裂未修复的普通话婴儿的接受性和表达性词汇技能及其相关性
7. Examination of adoption and appropriateness of learning contents in expressive movement and dance [O] . 村田芳子 2010

机译：考试对表达性动作和舞蹈中学习内容的采用和适当性

Expressive Speech-Driven Lip Movements with Multitask Learning

摘要

著录项

相似文献

相关主题

期刊订阅