首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components
【24h】

Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components

机译:基于HMM的语音合成的激励模型:分解周期性和非周期性成分的影响

获取原文

摘要

HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitation modeling of voiced speech. In order to alleviate this effect, several studies have proposed various new excitation models. No consensus has however been reached on what is the perceptual importance of the accurate modeling of the periodic and aperiodic components of voiced speech, and to what extent they separately contribute in improving naturalness. This paper considers a generalized mixed excitation modeling, common to various existing approaches, in which both periodic and aperiodic components coexist. At least three main factors may alter the quality of synthesis: periodic waveform, noise spectral weighting, and noise time envelope. Based on a large subjective evaluation, the goal of this paper is threefold: i) to evaluate the relative perceptual importance of each factor, ii) to investigate what is the most appropriate method to model the periodic and aperiodic components, and iii) to provide prospective clues for future work in excitation modeling.
机译:基于HMM的语音合成通常会因语音语音的激励建模过于简单而遭受典型的嗡嗡声。为了减轻这种影响,一些研究提出了各种新的激励模型。然而,关于语音的周期性和非周期性成分的精确建模在感知上的重要性以及它们分别在多大程度上有助于改善自然性,尚未达成共识。本文考虑了各种现有方法通用的广义混合激励模型,其中周期和非周期分量都共存。至少三个主要因素可能会改变合成质量:周期波形,噪声频谱加权和噪声时间包络。基于大量的主观评估,本文的目标是三个方面:i)评估每个因素的相对感知重要性,ii)研究建模周期和非周期性成分的最合适方法,以及iii)提供激励建模未来工作的潜在线索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号