首页> 外文期刊>American journal of applied sciences >Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis
【24h】

Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

机译:改善基于平均语音的泰语语音合成的音质清晰度

获取原文
获取原文并翻译 | 示例
           

摘要

Problem statement: Tone intelligibility in speech synthesis is an important attribute that should be taken into account. The tone correctness of the synthetic speech is degraded considerably in the average-voice-based HMM-based Thai speech synthesis. The tying mechanism in the decision tree based context clustering without appropriate criterion causes unexpected tone neutralization. Incorporation of the phrase intonation to the context clustering process in the training stage was proposed early. However, the tone correctness is not satisfied. Approach: This study proposes a number of tonal features including tone-geometrical features and phrase intonation features to be exploited in the context clustering process of HMM training stage. Results: In the experiments, subjective evaluations of both average voice and adapted voice in terms of the intelligibility of tone are conducted. Effects on decision trees of the extracted features are also evaluated. By considering gender in training speech, two core experiments were conducted. The first experiment shows that the proposed tonal features can improve the tone intelligibility for female speech model above that of male speech model, while the second experiment shows that the proposed tonal features give the better improvement of the tone intelligibility for gender dependent model than for gender independent model. Conclusion: All of the experimental results confirm that the tone correctness of the synthesized speech from the average-voice-based HMM-based Thai speech synthesis is significantly improved when using most of the extracted features.
机译:问题陈述:语音合成中的语音清晰度是应考虑的重要属性。在基于平均语音的基于HMM的泰语语音合成中,合成语音的音调正确性大大降低。没有适当条件的基于决策树的上下文聚类中的绑定机制会导致意外的色调中和。提早在训练阶段将短语语调并入上下文聚类过程中。但是,不满足色调正确性。方法:本研究提出了许多音调特征,包括音调几何特征和短语语调特征,这些特征将在HMM训练阶段的上下文聚类过程中加以利用。结果:在实验中,对平均声音和适应声音的主观评价均基于音调的清晰度进行。还评估了提取特征对决策树的影响。通过在培训演讲中考虑性别,进行了两个核心实验。第一个实验表明,所提出的音调特征可以提高女性言语模型的音调清晰度,而第二个实验表明,所提出的音调特征对于性别依赖性模型的音调清晰度要比对性别的模型更好。独立模型。结论:所有实验结果都证实,使用大多数提取的特征时,基于平均语音的基于HMM的泰语语音合成的合成语音的正确性得到了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号