首页> 外文会议>37th Annual Conference on IEEE Industrial Electronics Society >Speech synchronization between speech and lip shape movements for service robotics applications

【24h】

Speech synchronization between speech and lip shape movements for service robotics applications

机译：服务机器人应用中语音和嘴唇形状运动之间的语音同步

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Synchronization between speech and mouth shape includes technologies, such as computer vision, speech synthesis, and speech recognition. We present a method to synchronize the image and the speech, and we use Microsoft's Speech Application Programming Interface (SAPI) to be the speech synthesis tool. Speech animation includes two components, the speech and the image. Speech synthesis output is obtained from Text-to-Speech (TTS), and the images of visemes are generated from software, FaceGen Modeller. Import three key pictures to this software to calibrate and generate the face model. The viseme event handler in C# will connect the image of mouth shape and viseme together. Load the images sequentially and the visemes will one by one match with the images correctly. The main applications of speech synthesis are used as assistive devices, e.g. the use of screen readers for people with visual impairment. A mute person can take advantage of this technology to talk to others. In recent years, speech synthesis is extensively applied in service robotics and entertainment productions such as language learning, education, video games, animations, and music videos.

机译：语音和嘴形之间的同步包括诸如计算机视觉，语音合成和语音识别之类的技术。我们提出了一种同步图像和语音的方法，并且我们使用Microsoft的语音应用程序编程接口（SAPI）作为语音合成工具。语音动画包括两个部分，语音和图像。语音合成输出是从文本语音转换（TTS）获得的，而视位影音的图像是从FaceGen Modeller软件生成的。将三个关键图片导入该软件以进行校准并生成人脸模型。 C＃中的Viseme事件处理程序会将嘴形和Viseme的图像连接在一起。依次加载图像，视位将与图像正确地一一匹配。语音合成的主要应用被用作辅助设备，例如语音合成。视力障碍者使用屏幕阅读器。静音的人可以利用此技术与他人交谈。近年来，语音合成已广泛应用于服务机器人和娱乐产品，例如语言学习，教育，视频游戏，动画和音乐视频。

著录项

来源
《37th Annual Conference on IEEE Industrial Electronics Society 》|2011年|p.2261-2266|共6页
会议地点
作者
Luo Ren C.; Chien-Chieh Huang; Shu-Ruei Chang; Yi-Jeng Tsai;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类真空电子技术 ;
关键词
facial animation; speech synthesis; synchronization; text-to-speech; viseme;

机译：面部动画;语音合成;同步;文本到语音;粘滞;

相似文献

外文文献
中文文献
专利

1. Time-delay neural networks for estimating lip movements from speech analysis: a useful tool in audio-video synchronization [J] . Lavagetto F. IEEE Transactions on Circuits and Systems for Video Technology . 1997 ,第5期

机译：时延神经网络，可从语音分析中估计唇部运动：音视频同步中的有用工具
2. Observation-execution matching and action inhibition in human primary motor cortex during viewing of speech-related lip movements or listening to speech. [J] . Murakami T, Restle J, Ziemann U Neuropsychologia . 2011 ,第7期

机译：在查看与语音相关的唇部运动或听语音期间，人类初级运动皮层中的观察执行匹配和动作抑制。
3. The fusion of visual lip movements and mixed speech signals for robust speech separation [J] . Parham Aarabi, Bob Mungamuru Information Fusion . 2004 ,第2期

机译：融合视觉唇动和混合语音信号以实现可靠的语音分离
4. Speech synchronization between speech and lip shape movements for service robotics applications [C] . Luo Ren C., Chien-Chieh Huang, Shu-Ruei Chang, Annual Conference on IEEE Industrial Electronics Society . 2011

机译：用于服务机器人应用的语音与唇形运动之间的语音同步
5. The role of prosodic stress and speech perturbation on the temporal synchronization of speech and deictic gestures. [D] . Rusiewicz, Heather Leavy. 2010

机译：韵律重音和语音扰动在语音和惯性手势的时间同步中的作用。
6. A Geometric Morphometric Approach to the Analysis of Lip Shape during Speech: Development of a Clinical Outcome Measure [O] . Hashmat Popat, Stephen Richmond, Alexei I. Zhurov, 2010

机译：语音中唇形分析的几何形态计量学方法：临床结果量度的发展
7. Speech-Video Synchronization Using Lips Movements and Speech Envelope Correlation [O] . Amar A. El-sallam, Ajmal S. Mian 2010

机译：使用嘴唇运动和语音包络相关的语音-视频同步

Speech synchronization between speech and lip shape movements for service robotics applications

摘要

著录项

相似文献

相关主题

期刊订阅