首页> 外文会议>2017 IEEE Automatic Speech Recognition and Understanding Workshop >An investigation of multi-speaker training for wavenet vocoder
【24h】

An investigation of multi-speaker training for wavenet vocoder

机译:波网声码器多说话者训练研究

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we investigate the effectiveness of multi-speaker training for WaveNet vocoder. In our previous work, we have demonstrated that our proposed speaker-dependent (SD) WaveNet vocoder, which is trained with a single speaker's speech data, is capable of modeling temporal waveform structure, such as phase information, and makes it possible to generate more naturally sounding synthetic voices compared to conventional high-quality vocoder, STRAIGHT. However, it is still difficult to generate synthetic voices of various speakers using the SD-WaveNet due to its speaker-dependent property. Towards the development of speaker-independent WaveNet vocoder, we apply multi-speaker training techniques to the WaveNet vocoder and investigate its effectiveness. The experimental results demonstrate that 1) the multispeaker WaveNet vocoder still outperforms STRAIGHT in generating known speakers' voices but it is comparable to STRAIGHT in generating unknown speakers' voices, and 2) the multi-speaker training is effective for developing the WaveNet vocoder capable of speech modification.
机译:在本文中,我们研究了WaveNet声码器多扬声器培训的有效性。在我们之前的工作中,我们已经证明了我们提出的与扬声器相关的(SD)WaveNet声码器,该扬声器受单个扬声器的语音数据训练,能够对时间波形结构(例如相位信息)进行建模,并有可能产生更多的声音。与传统的高质量声码器STRAIGHT相比,听起来自然而然。但是,由于SD-WaveNet具有与扬声器相关的特性,因此仍然难以使用SD-WaveNet生成各种扬声器的合成声音。为了发展与扬声器无关的WaveNet声码器,我们将多扬声器训练技术应用于WaveNet声码器并研究其有效性。实验结果表明:1)多扬声器WaveNet声码器在生成已知扬声器的声音方面仍胜过STRAIGHT,但在生成未知扬声器的声音方面可与STRAIGHT媲美,并且2)多扬声器训练对于开发具有以下功能的WaveNet声码器是有效的:语音修改。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号