Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis

机译：将基于LSP的对数功率谱的全局方差整合到基于HMM的参数语音合成的MGE训练中

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a method to improve hidden Markov model (HMM) based parametric speech synthesis by integrating global variance (GV) of log power spectrum (LPS) derived from line spectral pairs (LSPs) into minimum generation error (MGE) model training. In order to alleviate the over-smoothing effect of the generated spectral structures, an LPS-GV based parameter generation method has been proposed. This method improved the naturalness of synthetic speech when LSPs were used as spectral features. However, it increased the complexity of parameter generation at synthesis time significantly. In this paper, we propose a method to integrate the distortions of LPS-GV derived from LSPs into the criterion of MGE model training in order to utilize LPSGV information at training time instead of at synthesis time. The experimental results show that this proposed method can achieve better naturalness of synthetic speech than the conventional MGE model training without loss of efficiency at synthesis time when LSPs are used as spectral features.

机译：本文提出了一种方法，通过将从线谱对（LSP）导出的对数功率谱（LPS）的全局方差（GV）集成到最小生成误差（MGE）模型训练中，来改进基于隐马尔可夫模型（HMM）的参数语音合成。为了减轻所生成频谱结构的过度平滑效应，提出了一种基于LPS-GV的参数生成方法。当LSP被用作频谱特征时，该方法提高了合成语音的自然性。但是，它大大增加了合成时参数生成的复杂性。在本文中，我们提出了一种将来自LSP的LPS-GV的失真整合到MGE模型训练准则中的方法，以便在训练时而不是在合成时利用LPSGV信息。实验结果表明，与传统的MGE模型训练相比，当LSPs被用作频谱特征时，该方法比传统的MGE模型训练具有更好的合成语音自然性。

著录项

来源
《International Symposium on Chinese Spoken Language Processing》|2014年|201-205|共5页
会议地点
作者
Sun Yu-Sheng; Ling Zhen-Hua; Yin Xiang; Dai Li-Rong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustic distortion; Acoustics; Hidden Markov models; Speech; Speech synthesis; Training; Vectors; Speech synthesis; global variance; hidden Markov model; line spectral pairs; log power spectrum;

机译：声失真;声学;隐马尔可夫模型;语音;语音合成;训练;矢量;语音合成;全局方差;隐马尔可夫模型;线谱对;对数功率谱;

相似文献

外文文献
中文文献
专利

1. Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis [J] . Zhen-Hua Ling, Richmond K., Yamagishi J., Audio, Speech, and Language Processing, IEEE Transactions on . 2009,第6期

机译：将发音特征集成到基于HMM的参数语音合成中
2. Efficient Implementation of Global Variance Compensation for Parametric Speech Synthesis [J] . Takashi Nose Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第10期

机译：参数语音合成的全局方差补偿的有效实现
3. Parameter Generation Considering LSP Ordering Property for HMM-Based Speech Synthesis [J] . Shijun Qian, Huanliang Wang, Wenjiang Pei, Signal Processing Letters, IEEE . 2012,第8期

机译：基于HMM的语音合成中考虑LSP排序特性的参数生成
4. Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis [C] . Sun Yu-Sheng, Ling Zhen-Hua, Yin Xiang, International Symposium on Chinese Spoken Language Processing . 2014

机译：将Log Power Spectum的全局方差集成到LSP中的基于HMM的参数语音合成的MGE培训
5. TRAJECTORY TRAINING CONSIDERING GLOBAL VARIANCE FOR HMM-BASED SPEECH SYNTHESIS [O] . Tomoki Toda, Steve Young 2010

机译：考虑基于HMM的语音合成的全局方差的轨迹训练

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅