【24h】

Voice conversion with pitch alteration using phase vocoder

机译:使用相位声码器进行音高改变的语音转换

获取原文
获取原文并翻译 | 示例

摘要

A novel approach to pitch transposition is presented, which allows to preserve the spectral envelope or to modify it in a voice conversion task. Pitch-shifted output is synthesized directly from the input signal using the phase vocoder, without explicitly estimating the fundamental frequency, which is often prone to errors. Frequency band extension technique is proposed to address the problem of bandwidth reduction when scaling pitch down. Artificial neural network was applied to transform the signal's spectral envelope in the full voice conversion task. Listening tests showed that 55% of listeners preferred the quality of the pitch modification using the phase vocoder over the one offered by a parametric approach.
机译:提出了一种新颖的音高转换方法,该方法可以保留频谱包络或在语音转换任务中对其进行修改。使用相位声码器可直接从输入信号合成音高移位的输出,而无需明确估计通常容易出错的基频。提出了频带扩展技术来解决按比例缩小音调时带宽减少的问题。在完整的语音转换任务中,使用了人工神经网络来转换信号的频谱包络。听力测试表明,有55%的听众更喜欢使用相位声码器而不是参数方法提供的音高修改质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号