首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >An investigation of multi-speaker training for wavenet vocoder
【24h】

An investigation of multi-speaker training for wavenet vocoder

机译:Wavenet Vocoder多扬声器训练的调查

获取原文

摘要

In this paper, we investigate the effectiveness of multi-speaker training for WaveNet vocoder. In our previous work, we have demonstrated that our proposed speaker-dependent (SD) WaveNet vocoder, which is trained with a single speaker's speech data, is capable of modeling temporal waveform structure, such as phase information, and makes it possible to generate more naturally sounding synthetic voices compared to conventional high-quality vocoder, STRAIGHT. However, it is still difficult to generate synthetic voices of various speakers using the SD-WaveNet due to its speaker-dependent property. Towards the development of speaker-independent WaveNet vocoder, we apply multi-speaker training techniques to the WaveNet vocoder and investigate its effectiveness. The experimental results demonstrate that 1) the multispeaker WaveNet vocoder still outperforms STRAIGHT in generating known speakers' voices but it is comparable to STRAIGHT in generating unknown speakers' voices, and 2) the multi-speaker training is effective for developing the WaveNet vocoder capable of speech modification.
机译:本文研究了Wavenet Vocoder的多扬声器训练的有效性。在我们以前的工作中,我们已经证明,我们所提出的扬声器依赖(SD)Wavenet声码器,其用单个扬声器的语音数据培训,能够建模时间波形结构,例如相位信息,并使其成为可能产生更多与传统的高品质声码器相比,自然发出合成声音,直。然而,由于其扬声器依赖性,仍然难以使用SD-Wavenet产生各种扬声器的合成声音。为了开发扬声器独立的Wavenet Vocoder,我们将多扬声器训练技术应用于Wavenet Vocoder并调查其有效性。实验结果表明,1)MultiSpeaker Wavenet Vocoder仍然优于发电的扬声器的声音,但它与直接产生未知的扬声器的声音相当,并且2)多扬声器训练对于开发能够的Wavenet Vocoder是有效的语音修改。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号