首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis
【24h】

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis

机译:将基于LSP的对数功率谱的全局方差整合到基于HMM的参数语音合成的MGE训练中

获取原文
获取外文期刊封面目录资料

摘要

This paper presents a method to improve hidden Markov model (HMM) based parametric speech synthesis by integrating global variance (GV) of log power spectrum (LPS) derived from line spectral pairs (LSPs) into minimum generation error (MGE) model training. In order to alleviate the over-smoothing effect of the generated spectral structures, an LPS-GV based parameter generation method has been proposed. This method improved the naturalness of synthetic speech when LSPs were used as spectral features. However, it increased the complexity of parameter generation at synthesis time significantly. In this paper, we propose a method to integrate the distortions of LPS-GV derived from LSPs into the criterion of MGE model training in order to utilize LPSGV information at training time instead of at synthesis time. The experimental results show that this proposed method can achieve better naturalness of synthetic speech than the conventional MGE model training without loss of efficiency at synthesis time when LSPs are used as spectral features.
机译:本文提出了一种方法,通过将从线谱对(LSP)导出的对数功率谱(LPS)的全局方差(GV)集成到最小生成误差(MGE)模型训练中,来改进基于隐马尔可夫模型(HMM)的参数语音合成。为了减轻所生成频谱结构的过度平滑效应,提出了一种基于LPS-GV的参数生成方法。当LSP被用作频谱特征时,该方法提高了合成语音的自然性。但是,它大大增加了合成时参数生成的复杂性。在本文中,我们提出了一种将来自LSP的LPS-GV的失真整合到MGE模型训练准则中的方法,以便在训练时而不是在合成时利用LPSGV信息。实验结果表明,与传统的MGE模型训练相比,当LSPs被用作频谱特征时,该方法比传统的MGE模型训练具有更好的合成语音自然性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号