【24h】

Quality Improvement of Psola Analysis-Synthesis Using Partial Zero-Phase Conversion

机译:部分零相转换提高Psola分析合成的质量

获取原文

摘要

This paper discusses two issues of the quality improvement of F0 modified speech based upon PSOLA analysis-synthesis. Previous studies [1][2] pointed out that the location of a window of PSOLA influences the quality of synthesized speech and one of them claimed that the center of a window should be located at a pitch pulse in source waveforms. However, pitch pulse detection sometimes fails due to undesired acoustic evnets. In this paper, several methods are experimetnally examined to reduce pitch pulse detection errors. Even when the detection is done correctly, F0 modified re-synthesized speech sometimes causes "echoes" in the re-arranged waveforms. This is mainly caused by a pitch pulse with small sharpness or by that with two relatively high pulses, not pitch pulses, before and after it. To suppress the echoes with little loss of naturalness, partial zero/#pi#-phase conversion is proposed here. Experimetns show the high validity of the proposed methods in improving the quality of re-synthesized speech.
机译:本文讨论了基于PSOLA分析-合成的F0语音质量改进的两个问题。先前的研究[1] [2]指出,PSOLA窗口的位置会影响合成语音的质量,其中之一声称窗口的中心应位于源波形中的基音脉冲处。但是,由于不希望的声波效应,音调脉冲检测有时会失败。在本文中,实验性地研究了几种减少音调脉冲检测误差的方法。即使正确完成检测,F0修改的重新合成语音有时也会在重新排列的波形中引起“回声”。这主要是由于尖锐度较小的音调脉冲,或者是由其前后的两个相对较高的脉冲而不是音调脉冲引起的。为了抑制回声而损失的自然度很小,这里提出了部分零/#pi#相转换。实验表明,所提出的方法在提高重新合成语音的质量方面具有很高的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号