首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Breathing and Speech Planning in Spontaneous Speech Synthesis
【24h】

Breathing and Speech Planning in Spontaneous Speech Synthesis

机译:自发性语音合成中的呼吸和语音计划

获取原文

摘要

Breathing and speech planning in spontaneous speech are coordinated processes, often exhibiting disfluent patterns. While synthetic speech is not subject to respiratory needs, integrating breath into synthesis has advantages for naturalness and recall. At the same time, a synthetic voice reproducing disfluent breathing patterns learned from the data can be problematic. To address this, we first propose training stochastic TTS on a corpus of overlapping breath-group bigrams, to take context into account. Next, we introduce an unsupervised automatic annotation of likely-disfluent breath events, through a product-of-experts model that combines the output of two breath- event predictors, each using complementary information and operating in opposite directions. This annotation enables creating an automatically-breathing spontaneous speech synthesiser with a more fluent breathing style. A subjective evaluation on two spoken genres (impromptu and rehearsed) found the proposed system to be preferred over the baseline approach treating all breath events the same.
机译:自发性言语中的呼吸和言语计划是协调的过程,通常表现出不流畅的模式。尽管合成语音不受呼吸需求的影响,但将呼吸整合到合成中对于自然性和回忆性具有优势。同时,重现从数据中学到的不舒服的呼吸模式的合成语音可能会出现问题。为了解决这个问题,我们首先建议对重叠的呼吸群二元组的语料库进行随机TTS训练,以考虑到上下文。接下来,我们通过专家产品模型引入无监督自动注释可能发生的呼吸事件的方法,该模型结合了两个呼吸事件预测变量的输出,每个预测变量使用互补信息并在相反的方向上运行。此注释可创建具有更流畅呼吸风格的自动呼吸自发语音合成器。对两种口语类型(即兴表演和演练)的主观评估发现,建议的系统优于所有呼吸事件均相同的基线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号