HMM-based transmodal mapping from audio speech to talking faces

机译：基于HMM的从语音到说话人脸的跨模式映射

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Describes a transmodal mapping from audio speech to talking faces based on hidden Markov models (HMMs). If face movements are synthesized well enough for natural communication, a lot of benefits will be brought to human-machine communication. This paper describes an HMM-based speech-driven lip movement synthesis. The paper also describes its improvement by audio-visual joint estimation and its extension to talking face generation. The results of evaluation experiments show that the proposed method generates natural and accurate talking faces from audio speech inputs.

机译：描述基于隐马尔可夫模型（HMM）的从语音到讲话脸的跨模式映射。如果将脸部运动合成得足够好以进行自然交流，则人机交流将带来很多好处。本文介绍了一种基于HMM的语音驱动的唇部运动合成。本文还描述了通过视听联合估计对其的改进及其对会说话的脸部生成的扩展。评估实验结果表明，该方法能够从语音输入中生成自然准确的说话人脸。

著录项

来源
《》|2000年|P.33-42|共10页
会议地点
作者
Nakamura; S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. How to Talk about Speech and Audio Quality with Speech and Audio People [J] . ALEXANDER RAAKE, MARCEL WAELTERMANN, ULF WUESTENHAGEN, Journal of the Audio Engineering Society . 2012,第3期

机译：如何与语音和音频人谈论语音和音频质量
2. HMM-Based Photo-Realistic Talking Face Synthesis Using Facial Expression Parameter Mapping with Deep Neural Networks [J] . Kazuki Sato, Takashi Nose, Akinori Ito Journal of Computer and Communications . 2017,第10期

机译：基于HMM的照片逼真谈话脸部合成使用面部表达参数映射与深神经网络
3. Do Your Ads Talk Too Fast To Your Audio Audience? How Speech Rates of Audio Commercials Influence Cognitive and Physiological Outcomes [J] . Rodero Emma Journal of advertising research . 2020,第3期

机译：您的广告是否太快了您的音频受众？音频广告的语音率如何影响认知和生理结果
4. Hmm-based transmodal mapping from audio speech to talking faces [C] . Satoshi Nakamura IEEE Signal Processing Society Workshop . 2000

机译：基于HMM的传输映射从音频语音到谈话面
5. Glove-TalkII: Mapping hand gestures to speech using neural networks. An approach to building adaptive interfaces. [D] . Fels, Sidney S. 1994

机译：Glove-TalkII：使用神经网络将手势映射到语音。一种构建自适应接口的方法。
6. Look who’s talking! Gaze patterns for implicit and explicit audio-visual speech synchrony detection in children with high-functioning autism [O] . Ruth B. Grossman, Erin Steinhart, Teresa Mitchell, -1

机译：看谁正在说话！高自闭症儿童的隐式和显式视听语音同步检测的注视模式
7. HMM-Based Photo-Realistic Talking Face Synthesis Using Facial Expression Parameter Mapping with Deep Neural Networks [O] . Kazuki Sato, Takashi Nose, Akinori Ito 2017

机译：基于HMM的照片逼真谈话脸部合成使用面部表达参数映射与深神经网络

HMM-based transmodal mapping from audio speech to talking faces

摘要

著录项

相似文献

相关主题

期刊订阅