首页> 外文会议>International Conference on Advanced Science and Technology >Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech
【24h】

Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech

机译:基于C-Tobi表示的中国韵律发电

获取原文

摘要

Prosody modeling is critical in developing text-to-speech (TTS) systems where speech synthesis is used to automatically generate natural speech. In this paper, we present a prosody generation architecture based on Chinese Tone and Break Index (C-ToBI) representation. ToBI is a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. We model Chinese prosody generation as a classification problem and apply conditional Maximum Entropy (ME) classification to this problem. We empirically verify the usefulness of various natural language and phonology features to make well-integrated features for ME framework.
机译:韵律建模对于开发语音合成用于自动产生自然语音的文本到语音(TTS)系统至关重要。在本文中,我们提出了一种基于中文音调和中断指数(C-TOBI)表示的韵律生成架构。 Tobi是一种基于语言知识的多层表示系统,以在话语中转录事件。已知采用TOBI作为中间表示的TTS系统,与直接韵律产生TTS系统相比表现出更高的灵活性,模块化和域/任务可移植性。我们将中国硕士发电作为分类问题,并将条件最大熵(ME)分类应用于此问题。我们经验验证了各种自然语言和语音学功能的有用性,为我提供良好的综合功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号