Residual-Based Excitation with Continuous FO Modeling in HMM-Based Speech Synthesis

机译：基于HMM的语音合成中基于残差的连续FO建模激励

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In statistical parametric speech synthesis, creaky voice can cause disturbing artifacts. The reason is that standard pitch tracking algorithms tend to erroneously measure FO in regions of creaky voice. This pattern is learned during training of hidden Markov-models (HMMs). In the synthesis phase, false voiced/unvoiced decision caused by creaky voice results in audible quality degradation. In order to eliminate this phenomena, we use a simple continuous FO tracker which does not apply a strict voiced/unvoiced decision. In the proposed residual-based vocoder, Maximum Voiced Frequency is used for mixed voiced and unvoiced excitation. As all parameters of the vocoder are continuous, Multi-Space Distribution is not necessary during training the HMMs, which has been shown to be advantageous. Artifacts caused by creaky voice are eliminated with this speech synthesis system. A subjective listening test of English utterances has shown improvement over the traditional excitation.

机译：在统计参数语音合成中，吱吱作响的语音会引起令人不快的伪影。原因是标准的音调跟踪算法往往会错误地测量发声嘶哑的区域中的FO。这种模式是在训练隐马尔可夫模型（HMM）期间学习的。在合成阶段，由于声音嘎吱作响而导致的错误的浊音/清音决策会导致可听质量下降。为了消除这种现象，我们使用了一个简单的连续FO跟踪器，该跟踪器没有应用严格的浊音/清音决定。在提出的基于残差的声码器中，最大浊音频率用于混合浊音和非浊音激励。由于声码器的所有参数都是连续的，因此在训练HMM时无需进行多空间分配，这已证明是有利的。用这种语音合成系统消除了由吱吱作响的声音引起的伪影。主观的英语话语听力测试显示，与传统的激发相比有所改善。

著录项

来源
《International conference on statistical language and speech processing》|2015年|27-38|共12页
会议地点
作者
Tamas Gabor Csapo; Geza Nemeth; Milos Cernak;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech synthesis; HMM; Creaky voice; Vocoder; Pitch tracking;

机译：语音合成; HMM;声音嘶哑;声码器音高跟踪;

相似文献

外文文献
中文文献
专利

1. Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis [J] . June Sig SUNG, Doo Hwa HONG, Hyun Woo KOO, IEICE transactions on information and systems . 2013,第2期

机译：基于HMM的语音合成中激励建模的统计方法
2. Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis [J] . June Sig SUNG, Doo Hwa HONG, Hyun Woo KOO, IEICE Transactions on Information and Systems . 2013,第2期

机译：基于HMM的语音合成中激励建模的统计方法
3. Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis [J] . Zhengqi Wen, Jianhua Tao, Shifeng Pan, Journal of VLSI signal processing systems for signal, image, and video technology . 2014,第3期

机译：基于音高的频谱激励基于HMM的语音合成模型
4. Residual-Based Excitation with Continuous FO Modeling in HMM-Based Speech Synthesis [C] . Tamas Gabor Csapo, Geza Nemeth, Milos Cernak International Conference on Statistical Language and Speech Processing . 2015

机译：基于HMM的语音合成中连续FO模型的残余励磁
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Hybrid Continuous Density Hmm-Based Ensemble Neural Networks for Sensor Fault Detection and Classification in Wireless Sensor Network [O] . Malathy Emperuman, Srimathi Chandrasekaran 2020

机译：基于混合连续密度基于Hmm的集成神经网络用于无线传感器网络中的传感器故障检测和分类
7. An Excitation Model for HMM-Based Speech Synthesis Based on Residual Modeling [O] . Ranniery Maia, Tomoki Toda, Heiga Zen, 2007

机译：基于残留建模的基于HMM的语音合成激励模型

Residual-Based Excitation with Continuous FO Modeling in HMM-Based Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅