首页> 外文会议>International Symposium on Tonal Aspects of Languages: Emphasis on Tone Languages >Prosodic Word Boundaries Prediction for Mandarin Text-to-Speech
【24h】

Prosodic Word Boundaries Prediction for Mandarin Text-to-Speech

机译:汉语普通话到语音的韵律词边界预测

获取原文

摘要

In Mandarin speech, the Prosodic Word (PW) is the basic rhythmic unit instead of Lexical Word (LW), and the naturalness of TTS will be directly influenced by the segmentation of PW. Most of the PWs are the combination of some LWs. In this paper, three models, i.e. a directed acyclic graph (DAG) model, segmentation model and Markov Model (MM) combined with Transformation-Based Error Driven (TBED) learning algorithm are designed to combine lexical words into prosodic words. Considering some long LWs should be broken into two or more PWs, a long word break model is also applied to those LWs. Experimental results show that MM combined with TBED plus a long word break model is the best one among the three methods, and 93.00% precision and 93.23% recall are achieved.
机译:在普通话中,韵律词(PW)是基本的韵律单元,而不是词汇词(LW),而TTS的自然性将直接受到PW分段的影响。大多数PW是某些LW的组合。本文设计了三种模型,即有向无环图(DAG)模型,分段模型和马尔可夫模型(MM)结合基于变换的错误驱动(TBED)学习算法,将词汇词组合为韵律词。考虑到一些长的LW应该分成两个或更多的PW,因此长字中断模型也适用于那些LW。实验结果表明,MM与TBED结合长单词中断模型相结合是三种方法中最好的一种,并且达到了93.00%的准确率和93.23%的查全率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号