【24h】

Estimation of GMM in voice conversion including unaligned data

机译:语音转换中的GMM估计,包括未对齐的数据

获取原文
获取原文并翻译 | 示例

摘要

Voice conversion consists in transforming a source speaker voice into a target speaker voice. There are many applications of voice conversion systems where the amount of training data from the source speaker and the target speaker is different. Usually, the amount of source data available is large, but it is desired to estimate the transformation with a small amount of target data. Systems based on joint Gaussian Mixture Models (GMM) are well suited to voice conversion, but they can't deal with source data without its corresponding aligned target data. In this paper, two alternatives are studied to incorporate unaligned source data in the estimation of a GMM for a voice conversion task. It is shown that when a limited amount of aligned parameters are available in the training step, to only include data from the source speaker increases the performance of the voice transformation.
机译:语音转换包括将源说话者语音转换为目标说话者语音。语音转换系统有许多应用,其中来自源说话者和目标说话者的训练数据量是不同的。通常,可用的源数据量很大,但是希望用少量的目标数据来估计转换。基于联合高斯混合模型(GMM)的系统非常适合语音转换,但是如果没有对应的对齐目标数据,它们就无法处理源数据。在本文中,研究了两种选择,以将未对齐的源数据合并到语音转换任务的GMM估计中。示出了当在训练步骤中有限数量的对齐参数可用时,仅包括来自源说话者的数据将提高语音转换的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号