首页> 外文会议>INTERSPEECH 2012 >Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis
【24h】

Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis

机译:基于HMM的语音合成的基于幅度谱的激励模型

获取原文

摘要

This paper describes an excitation model based on amplitude spectrum for hidden Markov model (HMM)-based speech synthesis system (HTS). Residual signal obtained from inverse filtering is decomposed into periodic and aperiodic spectrums in frequency domain. Amplitude spectrum of half pitch period length is reserved as periodic component in synthesis stage and zero-phase criterion and pitch synchronous overlap add method (PSOLA) are adopted to reconstruct the residual signal. Before integrating this excitation model into HTS, these periodic spectrums are normalized and Linde-Buzo-Gray (LBG) algorithm is adopted to construct codebooks for every Mandarin final1. Then index parameters from these codebooks which, are indicated as excitation information are taken into HTS training together with spectral, FO and aperiodic parameters. Listening test showed that for female voice the analysis-synthesis result of the vocoder based on proposed excitation model is comparable with that of STRAIGHT and when integrating into HTS, the quality of generated speech is also improved.
机译:本文介绍了基于隐马尔可夫模型(HMM)的语音合成系统(HTS)的幅度谱的激励模型。从逆滤波获得的残留信号被分解成频域中的周期性和非周期性频谱。半间距周期长度的幅度谱被保留为合成阶段中的周期性分量,采用零相标准和俯仰同步重叠添加方法(PSOLA)来重建残差信号。在将该激励模型集成到HTS之前,这些定期频谱是归一化的,并且采用LINDE-Buzo-灰(LBG)算法为每个普通话1构建码本。然后,从这些码本的索引参数被指示为激励信息,与光谱,FO和非周期性参数一起培训。聆听测试表明,对于女性语音,基于所提出的激励模型的Vocoder的分析合成结果与直接和将其集成到HTS时,产生的语音的质量也得到了改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号