首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency
【24h】

Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency

机译:通过更改基本频率的生成过程模型的命令值来调整韵律在语音合成中的作用

获取原文

摘要

A method was developed to adapt prosody to a new speaker/style in speech synthesis. It is based on predicting differences between target and original speakers/styles and applying them to the original one. Differences in fundamental frequency (F_0) contours are represented in the framework of the generation process model; differences in the command magnitudes/amplitudes. While the original one requires a certain amount of training corpus, while corpus for training command differences can be small. Furthermore, in the case of style adaptation, it is not necessarily the corpus being uttered by the same speaker of the original style. Speech synthesis was conducted using HMM-based speech synthesis system, where prosody was controlled by the method. Listening experiments on synthetic speech with style adaptation and voice conversion both showed the validity of the method.
机译:开发了一种使韵律适应语音合成中新的说话者/风格的方法。它基于预测目标说话者/原始说话者/样式之间的差异,并将其应用于原始说话者/样式。在生成过程模型的框架中表示了基本频率(F_0)轮廓的差异。命令幅度/幅度的差异。虽然最初的一个需要一定数量的训练语料,但是用于训练命令的语料差异可能很小。此外,在样式适应的情况下,语料库不一定是由原始样式的同一说话者说出的。使用基于HMM的语音合成系统进行语音合成,其中通过该方法控制韵律。带有风格适应和语音转换的合成语音的听力实验都证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号