Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech

Nick CAMPBELL

首页> 外文期刊>IEICE Transactions on Information and Systems >Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech

【24h】

Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech

机译：基于语料库的语音合成的发展：接近自然会话语音

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper describes the special demands of conversational speech in the context of corpus-based speech synthesis. The author proposed the CHATR system of prosody-based unit-selection for concate-native waveform synthesis seven years ago, and now extends this work to incorporate the results of an analysis of five-years of recordings of spontaneous conversational speeech in a wide range of actual daily-life situations. The paper proposes that the expresion of affect (often translated as 'kansei' in Japanese) is the main factor differentiating laboratory speech from real-world conversational speech, and presents a framework for the specification of affect through differences in speaking style and voice quality. Having an enormous corpus of speech samples available for concatenation allows the selection of complete phrase-sized utterance segments, and changes the focus of unit selection from segmental or phonetic continuity to one of prosodic and discoursal appropriateness instead.

机译：本文介绍了基于语料库的语音合成中对话语音的特殊要求。作者在七年前提出了基于韵律的单元选择的CHATR系统，用于级联波形合成，现在将这项工作扩展到合并自发性会话语音五年记录的分析结果，包括实际的日常生活情况。本文提出情感的表达（通常在日语中被翻译为“ kansei”）是区分实验室语音和真实对话语音的主要因素，并提出了通过语音风格和语音质量差异来规范情感的框架。具有大量可用于连接的语音样本的语料库可以选择完整的短语大小的发声段，并将单元选择的重点从分段或语音连续性改为韵律和劝阻性适当性之一。

著录项

来源
《IEICE Transactions on Information and Systems》 |2005年第3期|p.376-383|共8页
作者
Nick CAMPBELL;
展开▼
作者单位

Department of Emergent Communication of the ATR Network Informatics Laboratories, Kyoto-fu, 619-0288 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speech synthesis; corpora; concatenation; paralinguistic information; communication; affect;

机译：语音合成;语料库;级联;副语言信息;沟通;影响;

相似文献

外文文献
中文文献
专利

1. A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems [J] . Francisco Campillo Díaz, Eduardo Rodríguez Banga Speech Communication . 2006,第8期

机译：基于语料库的语音合成系统中语调建模与语音单元选择相结合的方法
2. Text-to-Speech Synthesis Technology Using Corpus-Based Approach [J] . Hideyuki Mizuno, Hisako Asano, Mitsuaki Isogai, NTT Technical Review . 2004,第3期

机译：基于语料库的文本语音合成技术
3. A Corpus-Based Approach to Speech Enhancement From Nonstationary Noise [J] . Ming J.Srinivasan R.Crookes D. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第4期

机译：基于语料库的非平稳噪声语音增强方法
4. Annotating conversational speech for corpus-based dialogue speech synthesizer — A first step [C] . Mori Hiroki, Hitomi Takatsugu 2012 International Conference on Speech Database and Assessments. . 2012

机译：为基于语料库的对话语音合成器注释对话语音-第一步
5. Conversational Speech Understanding in Highly Naturalistic Audio Streams [D] . Kaushik, Lakshmish. 2018

机译：高度自然主义的音频流中的会话语音理解
6. The development of lingual gestures in speech: experimental approach to language development [O] . Lucie Ménard, Aude Noiray -1

机译：语言的手势在言语发展：实验方法语言发展
7. Corpus-based unit selection for natural-sounding speech synthesis [O] . Yi Jon Rong-Wei 1975- 2003

机译：基于语料库的单元选择，用于自然发声的语音合成

Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅