首页> 外文会议>2012 International Conference on Audio, Language and Image Processing. >STRAIGHT model for voice conversion based on acoustical universal structure
【24h】

STRAIGHT model for voice conversion based on acoustical universal structure

机译:基于声学通用结构的STRAIGHT语音转换模型

获取原文
获取原文并翻译 | 示例

摘要

The existing voice conversion (VC) systems, those based on Gaussian mixture models(GMM), bring the problems of over smoothing of GMM mapping. With an aim towards resolving these problems, this paper provides a method on Acoustical Universal Structure (ASU) that can be applied to voice conversion based on GMM. Our contributions include:1) speech transformation and representation using adaptive interpolation of weighted-spectrum (STRAIGHT) model is taken which allows flexible manipulation of speech parameters such as pitch, vocal tract length, and speaking rate while maintaining high reproduction quality;2) The advantage of the paper is attributed to the introduction of the predictable spectrum, the ASU, in this paper, is introduced to form the mapping relationship between the source speaker and target speaker.3) In the training phase, the feedback strategy is adopted, which guarantee the smooth translation of spectral parameters between frames. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of speech quality, conversion accuracy and naturalness for speaker individuality from the objective and subjective tests.
机译:现有的基于高斯混合模型(GMM)的语音转换(VC)系统带来了GMM映射过度平滑的问题。为了解决这些问题,本文提供了一种基于声学通用结构(ASU)的方法,该方法可应用于基于GMM的语音转换。我们的贡献包括:1)使用加权频谱自适应插值(STRAIGHT)模型进行语音转换和表示,可以在保持高音质的同时灵活地控制语音参数,例如音高,声道长度和讲话率; 2)本文的优势是由于可预测频谱的引入,本文引入了ASU来形成源说话者与目标说话者之间的映射关系。3)在训练阶段,采用了反馈策略,确保帧之间频谱参数的平滑转换。实验结果表明,从语音质量,转换精度和说话人个性的自然性来看,通过主观和客观测试,该方法可以显着提高VC的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号