首页> 外文会议>37th Annual Conference on IEEE Industrial Electronics Society >Speech synchronization between speech and lip shape movements for service robotics applications
【24h】

Speech synchronization between speech and lip shape movements for service robotics applications

机译:服务机器人应用中语音和嘴唇形状运动之间的语音同步

获取原文

摘要

Synchronization between speech and mouth shape includes technologies, such as computer vision, speech synthesis, and speech recognition. We present a method to synchronize the image and the speech, and we use Microsoft's Speech Application Programming Interface (SAPI) to be the speech synthesis tool. Speech animation includes two components, the speech and the image. Speech synthesis output is obtained from Text-to-Speech (TTS), and the images of visemes are generated from software, FaceGen Modeller. Import three key pictures to this software to calibrate and generate the face model. The viseme event handler in C# will connect the image of mouth shape and viseme together. Load the images sequentially and the visemes will one by one match with the images correctly. The main applications of speech synthesis are used as assistive devices, e.g. the use of screen readers for people with visual impairment. A mute person can take advantage of this technology to talk to others. In recent years, speech synthesis is extensively applied in service robotics and entertainment productions such as language learning, education, video games, animations, and music videos.
机译:语音和嘴形之间的同步包括诸如计算机视觉,语音合成和语音识别之类的技术。我们提出了一种同步图像和语音的方法,并且我们使用Microsoft的语音应用程序编程接口(SAPI)作为语音合成工具。语音动画包括两个部分,语音和图像。语音合成输出是从文本语音转换(TTS)获得的,而视位影音的图像是从FaceGen Modeller软件生成的。将三个关键图片导入该软件以进行校准并生成人脸模型。 C#中的Viseme事件处理程序会将嘴形和Viseme的图像连接在一起。依次加载图像,视位将与图像正确地一一匹配。语音合成的主要应用被用作辅助设备,例如语音合成。视力障碍者使用屏幕阅读器。静音的人可以利用此技术与他人交谈。近年来,语音合成已广泛应用于服务机器人和娱乐产品,例如语言学习,教育,视频游戏,动画和音乐视频。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号