Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation

【24h】

Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation

机译：基于残余码本的激励在统计参量语音合成中建模不规则语音

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Statistical parametric text-to-speech synthesis is optimized for regular voices and may not create high-quality output with speakers producing irregular phonation frequently. A number of excitation models have been proposed recently in the hidden Markov-model speech synthesis framework, but few of them deal with the occurrence of this phenomenon. The baseline system of this study is our previous residual codebook based excitation model, which uses frames of pitch-synchronous residuals. To model the irregular voice typically occurring in phrase boundaries or sentence endings, two alternative extensions are proposed. The first, rule-based method applies pitch halving, amplitude scaling of residual periods with random factors and spectral distortion. The second, data-driven approach uses a corpus of residuals extracted from irregularly phonated vowels and unit selection is applied during synthesis. In perception tests of short speech segments, both methods have been found to improve the baseline excitation in preference and similarity to the original speaker. An acoustic experiment has shown that both methods can synthesize irregular voice that is close to original irregular phonation in terms of open quotient. The proposed methods may contribute to building natural, expressive and personalized speech synthesis systems.

机译：统计参数文本到语音的合成针对常规语音进行了优化，并且可能无法在扬声器频繁产生不规则发声的情况下产生高质量的输出。最近在隐马尔可夫模型语音合成框架中提出了许多激励模型，但是很少有模型可以解决这种现象的发生。这项研究的基准系统是我们先前基于残差码本的激励模型，该模型使用音高同步残差帧。为了对通常出现在短语边界或句子结尾中的不规则语音进行建模，提出了两个替代扩展。第一种基于规则的方法适用于音高减半，具有随机因素的剩余周期的幅度缩放和频谱失真。第二种数据驱动方法使用从不规则发声的元音中提取的残差语料库，并在合成过程中应用单元选择。在短语音段的感知测试中，已经发现这两种方法均可以改善基线激励，并且与原始说话者相似。声学实验表明，两种方法都可以合成不规则语音，该语音在开放商数方面接近原始不规则发声。所提出的方法可以有助于构建自然的，表达性的和个性化的语音合成系统。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2014年第2期|209-220|共12页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Creaky voice; HMM; excitation; glottalization; irregular phonation; parametric; residual; speech processing; speech synthesis; vocal fry; voice quality;

机译：刺耳的声音;HMM;激励;玻化;不规则发声;参数;残差;语音处理;语音合成;声母;声音质量;

相似文献

外文文献
中文文献
专利

1. Statistical parametric speech synthesis with a novel codebook-based excitation model [J] . Tamas Gabor Csapo, Geza Nemeth Intelligent decision technologies . 2014,第4期

机译：统计参数语音合成与基于新型密码本的激励模型
2. Acoustic Modeling Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis and Voice Conversion [J] . Zhen-Hua Ling, Ling-Hui Chen, Li-Rong Dai 電子情報通信学会技術研究報告. 音声. Speech . 2013,第366期

机译：使用受限Boltzmann机和Deep Belief网络进行声学建模以进行统计参数语音合成和语音转换
3. Excitation modelling using epoch features for statistical parametric speech synthesis [J] . M Kiran Reddy, K Sreenivasa Rao Computer speech and language . 2020,第Mara期

机译：使用纪元特征进行激励建模以进行统计参数语音合成
4. Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder [C] . Tamás Gábor Csapó, Géza Németh, Milos Cernak, European Signal Processing Conference . 2016

机译：使用连续声码器在统计参数语音合成中为清音建模
5. Speech statistical modelling and its applications in voice activity detector and speech enhancement. [D] . Zhang, Wei. 2002

机译：语音统计建模及其在语音活动检测器和语音增强中的应用。
6. Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis [O] . Marvin Coto-Jiménez 2021

机译：基于深度学习的判别多流破旧用于增强统计参数致辞综合
7. Voice Source Modelling Using Deep Neural Networks for Statistical Parametric Speech Synthesis [O] . Alku Paavo, Kane John, King Simon, 2014

机译：使用深度神经网络进行语音参量建模以进行统计参数语音合成

Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅