GMM-based Voice Conversion Applied to Emotional Speech Synthesis

机译：基于GMM的语音转换应用于情绪语音合成

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Voice conversion method is applied to synthesizing emotional speech from standard reading (neutral) speech. Pairs of neutral speech and emotional speech are used for conversion rule training. The conversion adopts GMM (Gaussian Mixture Model) with DFW (Dynamic Frequency Warping). We also adopt STRAIGHT, the high-quality speech analysis-synthesis algorithm. As conversion target emotions, (Hot) anger, (cold) sadness and (hot) happiness are used. The convened speech is evaluated objectively first using mel cepstrum distortion as a criterion. The result confirms the GMM-based voice conversion can reduce distortion between target speech and neutral speech. A subjective test is also carried out to investigate perceptual effect. From the viewpoint of influence of prosody, two kinds of prosody are used to synthesis. One is natural prosody extracted from neutral speech and the other is from emotional speech. The result shows that prosody mainly contribute to emotion and spectrum conversion can reinforce it.

机译：语音转换方法应用于合成标准读数（中性）语音的情绪语音。对中性语音和情感言论对进行转换规则培训。转换采用GMM（高斯混合模型）与DFW（动态频率翘曲）。我们还采用了直接，高质量的言语分析合成算法。随着转化的目标情绪，（热）愤怒，（冷）悲伤和（热）幸福被使用。召开的语音是客观地首先使用MEL Cepstrum失真作为标准评估的。结果证实基于GMM的语音转换可以减少目标语音和中立语音之间的失真。还进行了主观测试以调查感知效果。从韵律的影响的角度来看，两种韵律用于合成。一个是从中立言论提取的自然韵律，另一个是从情绪言论中。结果表明，韵律主要有助于情绪和频谱转换可以加强它。

著录项

来源
《European Conference on Speech Communication and Technology》|2003年||共4页
会议地点
作者
Hiromichi Kawanami; Yohei Iwami; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano; International Speech Communication Association(ISCA);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. Emotional speech synthesis using GMM-based voice conversion technique [J] . Yohei Iwami, Tomoki Toda, Hiromichi Kawanami, 電子情報通信学会技術研究報告. 音声. Speech . 2002,第619期

机译：使用基于GMM的语音转换技术进行情感语音合成
2. Emotional speech synthesis using GMM-based voice conversion technique [J] . Yohei Iwami, Tomoki Toda, Hiromichi Kawanami, 電子情報通信学会技術研究報告. 音声. Speech . 2002,第619期

机译：基于GMM的语音转换技术的情绪语音合成
3. A Multi-level GMM-Based Cross-Lingual Voice Conversion Using Language-Specific Mixture Weights for Polyglot Synthesis [J] . Ramani B., Jeeva M. P. Actlin, Vijayalakshmi P., Circuits, systems, and signal processing . 2016,第4期

机译：使用多语言合成的基于特定语言的混合权重的基于多级GMM的跨语言语音转换
4. GMM-based Voice Conversion Applied to Emotional Speech Synthesis [C] . Hiromichi Kawanami, Yohei Iwami, Tomoki Toda, European Conference on Speech Communication and Technology . 2003

机译：基于GMM的语音转换应用于情绪语音合成
5. Speech synthesis algorithms for voice conversion. [D] . Hsiao, Yung-Sheng. 1996

机译：用于语音转换的语音合成算法。
6. Comparing Individuals through the Speech Recognition Test Applied to Regional Live Voice and Recorded Speeches from Paraná State in Five Brazilian Counties [O] . Nicoli Valverde Mafra, Angela Ribas, Claudia Moretti, 2019

机译：通过语音识别测试对个人进行比较该测试适用于巴西五个县的巴拉那州的区域现场语音和录制的语音
7. GMM-Based Voice Conversion Applied to Emotional Speech Synthesis [O] . Kawanami Hiromichi, Iwami Yohei, Toda Tomoki, 2003

机译：基于GMM的语音转换在情感语音合成中的应用

GMM-based Voice Conversion Applied to Emotional Speech Synthesis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅