首页> 外文会议>2014 IEEE International Conference on Orange Technologies >Towards realizing gesture-to-speech conversion with a HMM-based bilingual speech synthesis system
【24h】

Towards realizing gesture-to-speech conversion with a HMM-based bilingual speech synthesis system

机译:借助基于HMM的双语语音合成系统来实现手势到语音的转换

获取原文
获取原文并翻译 | 示例

摘要

This paper realizes a gesture-to-speech conversion system to solve the communication problem between healthy people and speech disorders. An improved speeded up robust features (SURF) algorithm is adopted for static gesture recognition by combining Kinect sensor. Meanwhile, a Hidden Markov Model (HMM) based Mandarin-Tibetan bilingual speech synthesis system is developed by using speaker adaptive training. A set of semantic rules is designed for the static gestures. Chinese or Tibetan context-dependent labels of recognized static gestures are generated according to the semantic rules. The recognized gestures are finally converted to the Mandarin or Tibetan by using the Mandarin-Tibetan bilingual speech synthesis system with the context-dependent labels. Tests show that the static gesture recognition rate of the designed system achieves 97.1%. Subjective evaluation demonstrates that synthesized speech can get 4.0 of the mean opinion score (MOS) on synthesized speech.
机译:本文实现了一种手势到语音转换系统,以解决健康人与言语障碍之间的交流问题。结合Kinect传感器,采用改进的快速鲁棒特征(SURF)算法进行静态手势识别。同时,通过说话人自适应训练,开发了基于隐马尔可夫模型(HMM)的汉语-藏语双语语音合成系统。为静态手势设计了一组语义规则。根据语义规则生成已识别静态手势的中文或藏文上下文相关标签。最终,通过使用带有上下文相关标签的普通话-藏语双语语音合成系统,将识别出的手势转换为普通话或藏语。测试表明,所设计系统的静态手势识别率达到97.1%。主观评估表明,合成语音可以在合成语音上获得4.0的平均意见得分(MOS)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号