【24h】

Text-based Unstressed Syllable Prediction in Mandarin

机译:普通话的基于文本的非重读音节预测

获取原文

摘要

Recently, an increasing attention has been paid to Mandarin word stress which is important for improving the naturalness of speech synthesis. Most of the research on Mandarin speech synthesis focuses on three stress levels: stressed, regular and unstressed. This paper emphasizes the unstressed syllable prediction because the unstressed syllable is also important to the intelligibility of the synthetic speech. Similar as the prosodic structure, it is not easy to detect stress from text analysis due to the complicated context information. A method based on Classification and Regression Tree (CART) model has been proposed to predict the unstressed syllables with the high accuracy of 85%. The method has been finally applied into the TTS system. The experiment shows that the MOS score of synthetic speech has been improved by 0.35; the pitch contour of the new synthesized speech is also closer to natural speech.
机译:最近,人们越来越重视普通话单词的重音,这对于提高语音合成的自然性很重要。大多数关于普通话语音合成的研究都集中在三个压力水平上:压力,有规律和无压力。本文强调了非重读音节的预测,因为非重读音节对于合成语音的清晰度也很重要。与韵律结构相似,由于复杂的上下文信息,很难从文本分析中检测压力。提出了一种基于分类回归树(CART)模型的预测非重读音节的方法,其准确率高达85%。该方法已最终应用于TTS系统。实验表明,合成语音的MOS得分提高了0.35。新合成语音的音高轮廓也更接近自然语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号