首页> 外文期刊>IEICE Transactions on Information and Systems >Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech
【24h】

Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech

机译:基于语料库的语音合成的发展:接近自然会话语音

获取原文
获取原文并翻译 | 示例

摘要

This paper describes the special demands of conversational speech in the context of corpus-based speech synthesis. The author proposed the CHATR system of prosody-based unit-selection for concate-native waveform synthesis seven years ago, and now extends this work to incorporate the results of an analysis of five-years of recordings of spontaneous conversational speeech in a wide range of actual daily-life situations. The paper proposes that the expresion of affect (often translated as 'kansei' in Japanese) is the main factor differentiating laboratory speech from real-world conversational speech, and presents a framework for the specification of affect through differences in speaking style and voice quality. Having an enormous corpus of speech samples available for concatenation allows the selection of complete phrase-sized utterance segments, and changes the focus of unit selection from segmental or phonetic continuity to one of prosodic and discoursal appropriateness instead.
机译:本文介绍了基于语料库的语音合成中对话语音的特殊要求。作者在七年前提出了基于韵律的单元选择的CHATR系统,用于级联波形合成,现在将这项工作扩展到合并自发性会话语音五年记录的分析结果,包括实际的日常生活情况。本文提出情感的表达(通常在日语中被翻译为“ kansei”)是区分实验室语音和真实对话语音的主要因素,并提出了通过语音风格和语音质量差异来规范情感的框架。具有大量可用于连接的语音样本的语料库可以选择完整的短语大小的发声段,并将单元选择的重点从分段或语音连续性改为韵律和劝阻性适当性之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号