【24h】

Voice conversion with pitch alteration using phase vocoder

机译:使用相位声码器具有音高改变的语音转换

获取原文

摘要

A novel approach to pitch transposition is presented, which allows to preserve the spectral envelope or to modify it in a voice conversion task. Pitch-shifted output is synthesized directly from the input signal using the phase vocoder, without explicitly estimating the fundamental frequency, which is often prone to errors. Frequency band extension technique is proposed to address the problem of bandwidth reduction when scaling pitch down. Artificial neural network was applied to transform the signal's spectral envelope in the full voice conversion task. Listening tests showed that 55% of listeners preferred the quality of the pitch modification using the phase vocoder over the one offered by a parametric approach.
机译:提出了一种新颖的俯仰换位方法,其允许保留光谱包络或在语音转换任务中修改它。使用相位声码器直接从输入信号合成间距移位的输出,而不明确估计基本频率,这通常容易出错。建议频带扩展技术来解决缩放间距时的带宽减少问题。应用人工神经网络以在完整的语音转换任务中转换信号的光谱信封。聆听测试表明,55 %的听众优先使用参数方法提供的相位声码器使用相位声码器的音调修改的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号