Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech

机译：基于C-ToBI表示的文本语音转换中文韵律生成

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Prosody modeling is critical in developing text-to-speech (TTS) systems where speech synthesis is used to automatically generate natural speech. In this paper, we present a prosody generation architecture based on Chinese Tone and Break Index (C-ToBI) representation. ToBI is a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. We model Chinese prosody generation as a classification problem and apply conditional Maximum Entropy (ME) classification to this problem. We empirically verify the usefulness of various natural language and phonology features to make well-integrated features for ME framework.

机译：韵律模型对于开发语音转换用于自动生成自然语音的文本语音转换（TTS）系统至关重要。在本文中，我们提出了一种基于中文声调和折断指数（C-ToBI）表示的韵律生成架构。 ToBI是一个基于语言知识的多层表示系统，用于以语音方式记录事件。与直接韵律生成TTS系统相比，采用ToBI作为中间表示的TTS系统具有更高的灵活性，模块化和域/任务可移植性。我们将中文韵律生成建模为一个分类问题，并将条件最大熵（ME）分类应用于此问题。我们凭经验验证了各种自然语言和语音特性对于制作ME框架的良好集成特性的有用性。

著录项

来源
《Advances in computer science and information technology》|2010年|p.558-571|共14页
会议地点 Miyazaki(JP);Miyazaki(JP);Miyazaki(JP);Miyazaki(JP);Miyazaki(JP);Miyazaki(JP);Miyazaki(JP);Miyazaki(JP)
作者
Byeongchang Kim;
展开▼
作者单位

School of Computer Information Communications Engineering, Catholic University of Daegu, Gyeongbuk, South Korea;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Prosody Correction Preserving Speaker Individuality for Chinese-Accented Japanese HMM-Based Text-to-Speech Synthesis [J] . Daiki SEKIZAWA, Shinnosuke TAKAMICHI, Hiroshi SARUWATARI IEICE transactions on information and systems . 2019,第6期

机译：基于汉字的日本HMM语音合成中保留韵律校正的说话人个性
2. AUTOMATIC PROSODY GENERATION IN A TEXT-TO-SPEECH SYSTEM FOR HEBREW [J] . Branislav Popovi?, Dragan Kne?evi?, Milan Se?ujski, Facta Universitatis. Series Electronics and Energetics . 2014,第3期

机译：希伯来语文本到语音系统中的自动蛋白质生成
3. High-Quality Prosody Generation in Mandarin Text-to-Speech System [J] . Qing Guo, Jie Zhang, Nobuyuki Katae, Fujitsu Scientific & Technical Journal . 2010,第1期

机译：普通话语音合成系统中的高质量韵律生成
4. Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech [C] . Byeongchang Kim International Conference on Advanced Science and Technology . 2010

机译：基于C-Tobi表示的中国韵律发电
5. Model-Based Approach for Product Requirement Representation and Generation in Product Lifecycle Management [D] . Yaman, Omer. 2020

机译：基于模型的产品要求表示和产品生命周期管理中的发电方法
6. Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)–based ranking for concept normalization [O] . Dongfang Xu, Manoj Gopale, Jiacheng Zhang, 2020

机译：统一的医疗语言系统资源改善了从变压器（BERT）的基于筛的生成和双向编码器表示为概念标准化排名
7. Automatic prosody generation in a text-to-speech system for Hebrew [O] . Branislav Popovic, Dragan Knezevic, Milan Secujski, 2014

机译：用于希伯来语的文本到语音系统中的自动韵律生成

Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech

摘要

著录项

相似文献

相关主题

期刊订阅