首页> 外文会议>3rd International Congress on Image and Signal Processing >An algorithm for Chinese Voice conversion based on phonetic Gaussian mixture model
【24h】

An algorithm for Chinese Voice conversion based on phonetic Gaussian mixture model

机译:基于语音高斯混合模型的中文语音转换算法

获取原文

摘要

This paper proposed a novel algorithm for Chinese voice conversion based on phonetic Gaussian mixture model. The proposed method implemented spectral feature conversion for each category phoneme based on phonetic Gaussian mixture model, which prevented the spectral smoothing of traditional Gaussian mixture model (GMM) and avoided phoneme imbalance between training and testing materials in order to improve voice intelligibility and naturalness. Furthermore, the modification of pitch was achieved by manipulating the linear prediction-residual with the help of the knowledge of instants of significant excitation in order to improve the quality of synthesis speech. First, similarity to the target voice spectral was evaluated in an objective test and it was shown that the proposed algorithm improved similarity by 9.31% compared with GMM. In subjective listening test, an ABX test was performed and the proposed algorithm was preferred over the baseline algorithm by 10.36%, and improved quality by 29.33% in terms of mean opinion score (MOS).
机译:提出了一种基于语音高斯混合模型的中文语音转换新算法。该方法基于语音高斯混合模型为每个类别的音素实现了频谱特征转换,避免了传统高斯混合模型(GMM)的频谱平滑,避免了训练和测试材料之间的音素失衡,从而提高了语音清晰度和自然度。此外,通过在明显激发的瞬间的知识的帮助下操纵线性预测残差来实现音高的修改,从而提高合成语音的质量。首先,在客观测试中评估了与目标语音频谱的相似度,结果表明,与GMM相比,该算法将相似度提高了9.31%。在主观听力测试中,进行了ABX测试,与平均算法相比,所提出的算法比基线算法优先选择10.36%,根据平均意见得分(MOS),质量提高29.33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号