首页> 外文会议>International Conference on Electrical and Information Technologies for Rail Transportation >The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization
【24h】

The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization

机译:基于稀疏卷积非负矩阵分解的语音转换方法

获取原文

摘要

We propose a voice conversion method based on sparse convolutive non-negative matrix factorization. The method utilizes the Itakura-Saito distance as the objective cost function, making the smaller matrix element with a smaller reconstruction error due to the property of scale invariant of the cost function. The time-frequency basis of the source and target were extracted during the training phase, and the speech is converted through time-frequency basis substitution. The eifect of whisper-to-normal speech conversion experiment is also conducted. Experimental results show that the proposed voice conversion method outperforms the method based on the conventional convolutive non-negative matrix factorization and the method based on the Kullback-Leibler (K-L) cost function in the aspects of speech intelligibility.
机译:我们提出了一种基于稀疏卷积非负矩阵分解的语音转换方法。该方法利用Itakura-Saito距离作为目标成本函数,由于成本函数的尺度不变性,使得较小的矩阵元素具有较小的重构误差。在训练阶段提取源和目标的时频基础,并通过时频基础替换来转换语音。还进行了耳语到正常语音转换实验的效果。实验结果表明,所提出的语音转换方法在语音清晰度方面优于传统的卷积非负矩阵分解方法和基于Kullback-Leibler(K-L)代价函数的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号