首页> 外文会议>European Conference on Speech Communication and Technology >GMM-based Voice Conversion Applied to Emotional Speech Synthesis
【24h】

GMM-based Voice Conversion Applied to Emotional Speech Synthesis

机译:基于GMM的语音转换应用于情绪语音合成

获取原文
获取外文期刊封面目录资料

摘要

Voice conversion method is applied to synthesizing emotional speech from standard reading (neutral) speech. Pairs of neutral speech and emotional speech are used for conversion rule training. The conversion adopts GMM (Gaussian Mixture Model) with DFW (Dynamic Frequency Warping). We also adopt STRAIGHT, the high-quality speech analysis-synthesis algorithm. As conversion target emotions, (Hot) anger, (cold) sadness and (hot) happiness are used. The convened speech is evaluated objectively first using mel cepstrum distortion as a criterion. The result confirms the GMM-based voice conversion can reduce distortion between target speech and neutral speech. A subjective test is also carried out to investigate perceptual effect. From the viewpoint of influence of prosody, two kinds of prosody are used to synthesis. One is natural prosody extracted from neutral speech and the other is from emotional speech. The result shows that prosody mainly contribute to emotion and spectrum conversion can reinforce it.
机译:语音转换方法应用于合成标准读数(中性)语音的情绪语音。对中性语音和情感言论对进行转换规则培训。转换采用GMM(高斯混合模型)与DFW(动态频率翘曲)。我们还采用了直接,高质量的言语分析合成算法。随着转化的目标情绪,(热)愤怒,(冷)悲伤和(热)幸福被使用。召开的语音是客观地首先使用MEL Cepstrum失真作为标准评估的。结果证实基于GMM的语音转换可以减少目标语音和中立语音之间的失真。还进行了主观测试以调查感知效果。从韵律的影响的角度来看,两种韵律用于合成。一个是从中立言论提取的自然韵律,另一个是从情绪言论中。结果表明,韵律主要有助于情绪和频谱转换可以加强它。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号