首页> 外文会议>INTERSPEECH 2012 >Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis

【24h】

Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis

机译：基于HMM的语音合成的基于幅度谱的激励模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an excitation model based on amplitude spectrum for hidden Markov model (HMM)-based speech synthesis system (HTS). Residual signal obtained from inverse filtering is decomposed into periodic and aperiodic spectrums in frequency domain. Amplitude spectrum of half pitch period length is reserved as periodic component in synthesis stage and zero-phase criterion and pitch synchronous overlap add method (PSOLA) are adopted to reconstruct the residual signal. Before integrating this excitation model into HTS, these periodic spectrums are normalized and Linde-Buzo-Gray (LBG) algorithm is adopted to construct codebooks for every Mandarin final1. Then index parameters from these codebooks which, are indicated as excitation information are taken into HTS training together with spectral, FO and aperiodic parameters. Listening test showed that for female voice the analysis-synthesis result of the vocoder based on proposed excitation model is comparable with that of STRAIGHT and when integrating into HTS, the quality of generated speech is also improved.

机译：本文介绍了基于隐马尔可夫模型（HMM）的语音合成系统（HTS）的幅度谱的激励模型。从逆滤波获得的残留信号被分解成频域中的周期性和非周期性频谱。半间距周期长度的幅度谱被保留为合成阶段中的周期性分量，采用零相标准和俯仰同步重叠添加方法（PSOLA）来重建残差信号。在将该激励模型集成到HTS之前，这些定期频谱是归一化的，并且采用LINDE-Buzo-灰（LBG）算法为每个普通话1构建码本。然后，从这些码本的索引参数被指示为激励信息，与光谱，FO和非周期性参数一起培训。聆听测试表明，对于女性语音，基于所提出的激励模型的Vocoder的分析合成结果与直接和将其集成到HTS时，产生的语音的质量也得到了改善。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Zhengqi Wen; Jianhua Tao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
speech synthesis; HMM-based speech synthesis; excitation model; amplitude spectrum;

机译：语音合成;基于HMM的语音合成;励磁模型;幅度谱;
入库时间 2022-08-20 22:09:17

相似文献

外文文献
中文文献
专利

1. Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis [J] . Zhengqi Wen, Jianhua Tao, Shifeng Pan, Journal of VLSI signal processing systems for signal, image, and video technology . 2014,第3期

机译：基于音高的频谱激励基于HMM的语音合成模型
2. Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis [J] . June Sig SUNG, Doo Hwa HONG, Hyun Woo KOO, IEICE transactions on information and systems . 2013,第2期

机译：基于HMM的语音合成中激励建模的统计方法
3. Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis [J] . June Sig SUNG, Doo Hwa HONG, Hyun Woo KOO, IEICE Transactions on Information and Systems . 2013,第2期

机译：基于HMM的语音合成中激励建模的统计方法
4. Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis [C] . Zhengqi Wen, Jianhua Tao Annual conference of the International Speech Communication Association . 2012

机译：基于幅度谱的基于HMM的语音合成激励模型
5. Analysis and synthesis on speech based on an human auditory modeling [D] . Lee, Minkyu. 1996

机译：基于人类听觉建模的言语分析与综合
6. Neural Spike-Train Analyses of the Speech-Based Envelope Power Spectrum Model [O] . Varsha H. Rallapalli, Michael G. Heinz 2016

机译：基于语音的包络功率谱模型的神经峰值训练分析
7. An Excitation Model for HMM-Based Speech Synthesis Based on Residual Modeling [O] . Ranniery Maia, Tomoki Toda, Heiga Zen, 2007

机译：基于残留建模的基于HMM的语音合成激励模型

Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅