首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation
【24h】

Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation

机译:基于残余码本的激励在统计参量语音合成中建模不规则语音

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Statistical parametric text-to-speech synthesis is optimized for regular voices and may not create high-quality output with speakers producing irregular phonation frequently. A number of excitation models have been proposed recently in the hidden Markov-model speech synthesis framework, but few of them deal with the occurrence of this phenomenon. The baseline system of this study is our previous residual codebook based excitation model, which uses frames of pitch-synchronous residuals. To model the irregular voice typically occurring in phrase boundaries or sentence endings, two alternative extensions are proposed. The first, rule-based method applies pitch halving, amplitude scaling of residual periods with random factors and spectral distortion. The second, data-driven approach uses a corpus of residuals extracted from irregularly phonated vowels and unit selection is applied during synthesis. In perception tests of short speech segments, both methods have been found to improve the baseline excitation in preference and similarity to the original speaker. An acoustic experiment has shown that both methods can synthesize irregular voice that is close to original irregular phonation in terms of open quotient. The proposed methods may contribute to building natural, expressive and personalized speech synthesis systems.
机译:统计参数文本到语音的合成针对常规语音进行了优化,并且可能无法在扬声器频繁产生不规则发声的情况下产生高质量的输出。最近在隐马尔可夫模型语音合成框架中提出了许多激励模型,但是很少有模型可以解决这种现象的发生。这项研究的基准系统是我们先前基于残差码本的激励模型,该模型使用音高同步残差帧。为了对通常出现在短语边界或句子结尾中的不规则语音进行建模,提出了两个替代扩展。第一种基于规则的方法适用于音高减半,具有随机因素的剩余周期的幅度缩放和频谱失真。第二种数据驱动方法使用从不规则发声的元音中提取的残差语料库,并在合成过程中应用单元选择。在短语音段的感知测试中,已经发现这两种方法均可以改善基线激励,并且与原始说话者相似。声学实验表明,两种方法都可以合成不规则语音,该语音在开放商数方面接近原始不规则发声。所提出的方法可以有助于构建自然的,表达性的和个性化的语音合成系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号