Towards realizing gesture-to-speech conversion with a HMM-based bilingual speech synthesis system

机译：借助基于HMM的双语语音合成系统来实现手势到语音的转换

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper realizes a gesture-to-speech conversion system to solve the communication problem between healthy people and speech disorders. An improved speeded up robust features (SURF) algorithm is adopted for static gesture recognition by combining Kinect sensor. Meanwhile, a Hidden Markov Model (HMM) based Mandarin-Tibetan bilingual speech synthesis system is developed by using speaker adaptive training. A set of semantic rules is designed for the static gestures. Chinese or Tibetan context-dependent labels of recognized static gestures are generated according to the semantic rules. The recognized gestures are finally converted to the Mandarin or Tibetan by using the Mandarin-Tibetan bilingual speech synthesis system with the context-dependent labels. Tests show that the static gesture recognition rate of the designed system achieves 97.1%. Subjective evaluation demonstrates that synthesized speech can get 4.0 of the mean opinion score (MOS) on synthesized speech.

机译：本文实现了一种手势到语音转换系统，以解决健康人与言语障碍之间的交流问题。结合Kinect传感器，采用改进的快速鲁棒特征（SURF）算法进行静态手势识别。同时，通过说话人自适应训练，开发了基于隐马尔可夫模型（HMM）的汉语-藏语双语语音合成系统。为静态手势设计了一组语义规则。根据语义规则生成已识别静态手势的中文或藏文上下文相关标签。最终，通过使用带有上下文相关标签的普通话-藏语双语语音合成系统，将识别出的手势转换为普通话或藏语。测试表明，所设计系统的静态手势识别率达到97.1％。主观评估表明，合成语音可以在合成语音上获得4.0的平均意见得分（MOS）。

著录项

来源
《2014 IEEE International Conference on Orange Technologies》|2014年|97-100|共4页
会议地点 Xian(CN)
作者
Hongwu Yang; Xiaochun An; Dong Pei; Yitong Liu;
展开▼
作者单位

Coll. of Phys. Electron. Eng., Northwest Normal Univ., Lanzhou, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
feature extraction; gesture recognition; handicapped aids; hidden Markov models; natural language processing; sensors; speaker recognition; speech synthesis; Chinese context-dependent labels; HMM based Mandarin-Tibetan bilingual speech synthesis system; Kinect sensor; MOS; SURF algorithm; Tibetan context-dependent labels; communication problem; gesture-to-speech conversion system; healthy people; hidden Markov model; mean opinion score; semantic rules; speaker adaptive training; speech disorders; speeded up robust fe;

机译：特征提取;手势识别;残障辅助工具;隐马尔可夫模型;自然语言处理;传感器;说话人识别;语音合成;中文上下文相关标签;基于HMM的普通话-藏语双语语音合成系统; Kinect传感器; MOS; SURF算法;藏语上下文相关标签;沟通问题;手势到语音转换系统;健康人;隐马尔可夫模型;平均意见得分;语义规则;说话人自适应训练;语音障碍;加速健壮的fe;
入库时间 2022-08-26 13:54:02

相似文献

外文文献
中文文献
专利

1. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
2. HMM-based Speech Synthesis with Multiple Individual Voices using Exemplar-based Voice Conversion [J] . Trung-Nghia Phung International journal of computer science and network security . 2017,第5期

机译：使用基于示例的语音转换，具有多个单独语音的基于HMM的语音合成
3. Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion [J] . Gia-Nhu Nguyen, Trung-Nghia Phung EURASIP journal on audio, speech, and music processing . 2017,第1期

机译：使用基于示例的语音转换减少基于HMM的语音合成中的过度平滑
4. Towards realizing gesture-to-speech conversion with a HMM-based bilingual speech synthesis system [C] . Hongwu Yang, Xiaochun An, Dong Pei, IEEE International Conference on Orange Technologies . 2014

机译：利用基于HMM的双语语音合成系统实现姿态到语音转换
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. What can speech production errors tell us about cross-linguistic processing in bilingual aphasia? Evidence from four English/Afrikaans bilingual individuals with aphasia [O] . Diane Kendall, Lisa Edmonds, Anine van Zyl, 2015

机译：关于双语失语的跨语言处理言语产生错误能告诉我们什么？来自四名英语/南非语双语失语症患者的证据
7. A novel set of synthesis units with stable spectral boundaries for HMM-based Mandarin speech synthesis system [O] . Jiao Yishan, Xie Xiang, Tu Ming, 2013

机译：一种新型的合成单元，具有稳定的肝硬化型普通话语音合成系统的稳定光谱界限

Towards realizing gesture-to-speech conversion with a HMM-based bilingual speech synthesis system

摘要

著录项

相似文献

相关主题

期刊订阅