Studies of Audiovisual Speech Perception Using Production-Based Animation

机译：基于生产动画的视听语音感知研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper will summarize our work at Queen's Unvierstiy and ATR Laboratories on cross-modal speech perception and production. Our approach has been to study these two sides of speech together and to use the multi-moal speech production data to parameterize and control audiovisual animation systems. Two approaches to production-based facial animation have been pursued one statistical and the other physical. In both cases, realistic talking head animations are generated from continuous input of production data. The statistical animation method of AV synthesis extends our multi-linear techniques developed for the analysis of orofacial motion and speech acoustics to include the correlation between measured 3D positions on the face and deformation coefficients of the facial surface. In the physical approach, the dynamic form of the animaltion is determined by the biophysical characteristics of the animated object. The physical model consists of multiple structural layers: model skull and jaw surfaces, an orofacial muscle layer, and a three-layer polygon model of the soft tissue. In a series of studies using these animation approaches we have examined the conditions under which speech perception in noise is enhanced by simultaneous visual presentation. Our data shwo a distinction between visual prosody and segmental percetion as well as demonstrating that our animated stimuli produce natural increases in speech intelligibility.

机译：本文将总结我们在皇后大学（Queen's Unvierstiy）和ATR实验室在跨模态语音感知和产生方面的工作。我们的方法是一起研究语音的这两个方面，并使用多目标语音产生数据来参数化和控制视听动画系统。已经采用了两种基于生产的面部动画的方法，一种是统计方法，另一种是物理方法。在这两种情况下，真实的发声动画都是通过连续输入生产数据来生成的。 AV合成的统计动画方法扩展了我们为分析脸部运动和语音声学而开发的多线性技术，以包括在脸部上测得的3D位置与脸部表面的变形系数之间的相关性。在物理方法中，动物动画的动态形式由动画对象的生物物理特性决定。物理模型由多个结构层组成：模型头骨和下颌表面，口腔颌面肌层以及软组织的三层多边形模型。在使用这些动画方法的一系列研究中，我们研究了通过同时进行视觉呈现来增强噪声中语音感知的条件。我们的数据显示了视觉韵律和节段感知力之间的区别，并证明了我们的动画刺激会自然地提高语音清晰度。

著录项

来源
《6th International conference on Spoken Language Processing ICSLP 2000 Oct. 16-Oct.20 2000 Beijing International Convention Center, Beijing, China》|2000年|p.7-10|共4页
会议地点
作者
K.G.Munhall; C.Kroos; T.Kuratate; J.Lucero; M.Pitermann; E.Vatikiotis-Bateson; H.Yehia;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类世界各国文化与文化事业;
关键词

相似文献

外文文献
中文文献
专利

1. Does Audiovisual Speech Offer a Fountain of Youth for Old Ears? An Event-Related Brain Potential Study of Age Differences in Audiovisual Speech Perception [J] . Winneke A.H., Phillips N.A. Psychology and aging . 2011,第2期

机译：视听演讲能否为年轻人提供青春之泉？视听语音感知中年龄差异的事件相关脑潜能研究
2. Audiovisual synchrony perception of simplified speech sounds heard as speech and non-speech [J] . Kaori Asakawa, Akihiro Tanaka, Shuichi Sakamoto, Acoustical science and technology . 2011,第3期

机译：视听同步感知，以语音和非语音形式听到的简化语音
3. Audiovisual speech perception and language acquisition in preterm infants: A longitudinal study [J] . Imafuku Masahiro, Kawai Masahiko, Niwa Fusako, Early human development . 2019,第期

机译：在早产儿的视听语言感知和语言习得：纵向研究
4. Studies of Audiovisual Speech Perception Using Production-Based Animation [C] . K.G.Munhall, C.Kroos, T.Kuratate, International conference on spoken language processing . 2000

机译：基于生产的动画的视听语音感知研究
5. The Role of the Motor System in Speech Perception and the Neural Substrates of Audiovisual Speech Integration [D] . Michaelis, Kelly Cecile. 2019

机译：电机系统在语音感知中的作用和视听语音集成的神经基板
6. Auditory Deficits in Audiovisual Speech Perception in Adult Asperger’s Syndrome: fMRI Study [O] . Fabian-Alexander Tietze, Laura Hundertmark, Mandy Roy, 1993

机译：成年阿斯伯格综合症视听语音感知中的听觉缺陷：fMRI研究
7. Real-time audiovisual speech capture and motion tracking for speech-driven facial animation [O] . Jablonski Karl Adam 2013

机译：语音驱动的面部动画的实时视听语音捕获和运动跟踪

Studies of Audiovisual Speech Perception Using Production-Based Animation

摘要

著录项

相似文献

相关主题

期刊订阅