首页> 外文期刊>Journal of computer sciences >Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis
【24h】

Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis

机译:基于说话人和与说话人无关的隐马尔可夫模型的泰语语音合成的发展

获取原文
获取原文并翻译 | 示例
           

摘要

Problem statement: Tone distortion in Thai languages can deteriorate not only the intelligibility of speech but also its naturalness. Therefore, the correctness of tone must be carefully taken into account in continuous speech synthesis. The preliminary work confronted this problem when applying HMM-based speech synthesis to Thai. Approach: This study presented a study on speaker-dependent and speaker-independent Hidden Markov Model (HMM)-based Thai speech synthesis. In the speaker-dependent system, we developed a simple tone-separated tree structure in the tree-based context clustering process of the training stage to treat the tone distortion problem. In the speaker-independent system or averaged-voice-model system, a number of tonal features are extracted and applied with the Speaker Adaptive Training (SAT) and Shared Decision Tree (STC) techniques to release the tone distortion problem. Results: Our objective evaluation revealed that the proposed features could make the FO contour closer to the target speaker's real contour. The results from our subjective test also revealed that the proposed tonal features could improve the tone intelligibility of all speech-model scenarios of male and female. Conclusion: By applying our approach, the problem of tone distortion can be relieved effectively. The better tone correctness can improve the intelligibility and the naturalness of speech significantly.
机译:问题陈述:泰语中的音调失真不仅会降低语音的清晰度,还会降低其自然性。因此,在连续语音合成中必须仔细考虑音调的正确性。在将基于HMM的语音合成应用于泰语时,初步工作遇到了这个问题。方法:本研究提出了基于说话者依赖性和非说话者依赖性的隐马尔可夫模型(HMM)的泰语语音合成研究。在依赖说话者的系统中,我们在训练阶段的基于树的上下文聚类过程中开发了一个简单的音调分隔树结构,以处理音调失真问题。在独立于说话者的系统或平均语音模型系统中,提取了许多音调特征,并通过说话者自适应训练(SAT)和共享决策树(STC)技术加以应用,以消除音调失真问题。结果:我们的客观评估表明,所提出的功能可以使FO轮廓更接近目标说话人的真实轮廓。我们的主观测试结果还显示,建议的音调特征可以提高所有男性和女性语音模型场景的语调清晰度。结论:通过应用我们的方法,可以有效地解决音调失真的问题。更好的音调正确性可以显着提高语音的清晰度和自然度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号