首页> 外文会议>Advances in computer science and information technology >Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech
【24h】

Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech

机译:基于C-ToBI表示的文本语音转换中文韵律生成

获取原文
获取原文并翻译 | 示例

摘要

Prosody modeling is critical in developing text-to-speech (TTS) systems where speech synthesis is used to automatically generate natural speech. In this paper, we present a prosody generation architecture based on Chinese Tone and Break Index (C-ToBI) representation. ToBI is a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. We model Chinese prosody generation as a classification problem and apply conditional Maximum Entropy (ME) classification to this problem. We empirically verify the usefulness of various natural language and phonology features to make well-integrated features for ME framework.
机译:韵律模型对于开发语音转换用于自动生成自然语音的文本语音转换(TTS)系统至关重要。在本文中,我们提出了一种基于中文声调和折断指数(C-ToBI)表示的韵律生成架构。 ToBI是一个基于语言知识的多层表示系统,用于以语音方式记录事件。与直接韵律生成TTS系统相比,采用ToBI作为中间表示的TTS系统具有更高的灵活性,模块化和域/任务可移植性。我们将中文韵律生成建模为一个分类问题,并将条件最大熵(ME)分类应用于此问题。我们凭经验验证了各种自然语言和语音特性对于制作ME框架的良好集成特性的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号