...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Prosody Reconstruction by Rescaling Fundamental Frequency Contours in Order to Synthesize Communicative Speech
【24h】

Prosody Reconstruction by Rescaling Fundamental Frequency Contours in Order to Synthesize Communicative Speech

机译:通过重新调整基本频率轮廓以合成交际语音来进行韵律重构

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a method of prosody reconstruction that can be used to synthesize conversational speech. In our method, we use a conventional text-to-speech engine to initially generate reading-style prosody for input text. We then use a frequency modulation technique to rescale the fundamental frequency (F{sub}0) contours to add the communicative functions of intonation to the synthesized speech. The frequency modulation technique is based on a functional F{sub}0 model, and the transformation scales are modeled by combining simple piece-wise-linear patterns according to input tags. We conducted two experiments to evaluate our method: modulating the F{sub}0 range of reading-style prosody when synthesizing Japanese speech to convey "good news" and "bad news", and making a narrow focus when synthesizing Chinese dialog to convey emphasis. The results showed that our method could use much para-linguistic information to achieve specific communicative purposes.
机译:本文提出了一种韵律重构方法,可用于合成会话语音。在我们的方法中,我们使用常规的文本语音转换引擎来最初为输入文本生成阅读风格的韵律。然后,我们使用调频技术重新调整基本频率(F {sub} 0)的轮廓,以将语调的通信功能添加到合成语音中。频率调制技术基于功能性的F {sub} 0模型,并且通过根据输入标签组合简单的分段线性模式来对变换比例进行建模。我们进行了两个实验来评估我们的方法:在合成日语语音时传达“好消息”和“坏消息”时调节阅读式韵律的F {sub} 0范围,在合成中文对话时强调重点时要缩小焦点。结果表明,我们的方法可以使用大量的副语言信息来实现特定的交流目的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号