首页> 外文会议>Annual Conference of the International Speech Communication Association >A Voice Conversion Mapping Function based on a Stacked Joint-Autoencoder
【24h】

A Voice Conversion Mapping Function based on a Stacked Joint-Autoencoder

机译:基于堆叠的联合AutoEncoder的语音转换映射函数

获取原文

摘要

In this study, we propose a novel method for training a regression function and apply it to a voice conversion task. The regression function is constructed using a Stacked Joint-Autoencoder (SJAE). Previously, we have used a more primitive version of this architecture for pre-training a Deep Neural Network (DNN). Using objective evaluation criteria, we show that the lower levels of the SJAE perform best with a low degree of jointness, and higher levels with a higher degree of jointness. We demonstrate that our proposed approach generates features that do not suffer from the averaging effect inherent in back-propagation training. We also carried out subjective listening experiments to evaluate speech quality and speaker similarity. Our results show that the SJAE approach has both higher quality and similarity than a SJAE+DNN approach, where the SJAE is used for pre-training a DNN, and the fine-tuned DNN is then used for mapping. We also present the system description and results of our submission to Voice Conversion Challenge 2016.
机译:在本研究中,我们提出了一种训练回归函数的新方法,并将其应用于语音转换任务。回归函数使用堆叠的联合自动码器(Sjae)构造。以前,我们使用了更原始的这种架构版本,用于预先培训深度神经网络(DNN)。使用客观评估标准,我们表明SJAE的较低水平最佳地以低程度的关联,更高的程度,具有较高程度的关节。我们展示了我们所提出的方法产生不遭受后传播训练中固有的平均效果的特征。我们还开展了主观聆听实验,以评估语音质量和扬声器相似性。我们的结果表明,Sjae方法具有比Sjae + DNN方法更高的质量和相似性,其中Sjae用于预先训练DNN,然后将微调DNN用于映射。我们还提出了我们提交给2016的语音转换挑战的系统描述和结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号