首页> 外文会议>European Signal Processing Conference >CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion
【24h】

CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion

机译:CINC-GAN用于耳语到正常语音转换的有效F0预测

获取原文

摘要

Recently, Generative Adversarial Networks (GAN)based methods have shown remarkable performance for the Voice Conversion and WHiSPer-to-normal SPeeCH (WHSP2SPCH) conversion. One of the key challenges in WHSP2SPCH conversion is the prediction of fundamental frequency (F0). Recently, authors have proposed state-of-the-art method Cycle-Consistent Generative Adversarial Networks (CycleGAN) for WHSP2SPCH conversion. The CycleGAN-based method uses two different models, one for Mel Cepstral Coefficients (MCC) mapping, and another for F0 prediction, where F0 is highly dependent on the pre-trained model of MCC mapping. This leads to additional nonlinear noise in predicted F0. To suppress this noise, we propose Cycle-in-Cycle GAN (i.e., CinC-GAN). It is specially designed to increase the effectiveness in F0 prediction without losing the accuracy of MCC mapping. We evaluated the proposed method on a non-parallel setting and analyzed on speaker-specific, and gender-specific tasks. The objective and subjective tests show that CinC-GAN significantly outperforms the CycleGAN. In addition, we analyze the CycleGAN and CinC-GAN for unseen speakers and the results show the clear superiority of CinC-GAN.
机译:最近,基于生成的对抗网络(GaN)的方法对语音转换和耳语到正常语音(WHSP2SPCH)转换表示了显着性能。 WHSP2SPCH转换中的关键挑战之一是对基波频率的预测(F 0 )。最近,作者已经提出了最先进的方法循环一致的生成对抗网络(Consforgan),用于WHSP2SPCH转换。 Conscargan的方法使用两种不同的模型,一个用于MEL谱系齐数(MCC)映射,另一个用于F 0 预测,其中f 0 高度依赖于预先训练的MCC映射模型。这导致预测F中的额外非线性噪声 0 。为了抑制这种噪音,我们提出了循环循环GaN(即,CINC-GaN)。它专门旨在提高F的效果 0 预测而不会失去MCC映射的准确性。我们在非并行设置上评估了所提出的方法,并在扬声器特定和特定于性别的任务上进行分析。目标和主观测试表明,CINC-GaN显着优于Conscargan。此外,我们还分析了无见扬声器的Cencegan和Cinc-Gan,结果表明了CINC-GAN的清晰优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号