...
首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Applying the harmonic plus noise model in concatenative speechsynthesis
【24h】

Applying the harmonic plus noise model in concatenative speechsynthesis

机译:谐波加噪声模型在级联语音合成中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes the application of the harmonic plus noise model (HNM) for concatenative text-to-speech (TTS) synthesis. In the context of HNM, speech signals are represented as a time-varying harmonic component plus a modulated noise component. The decomposition of a speech signal into these two components allows for more natural-sounding modifications of the signal (e.g., by using different and better adapted schemes to modify each component). The parametric representation of speech using HNM provides a straightforward way of smoothing discontinuities of acoustic units around concatenation points. Formal listening tests have shown that HNM provides high-quality speech synthesis while outperforming other models for synthesis (e.g., TD-PSOLA) in intelligibility, naturalness, and pleasantness
机译:本文介绍了谐波加噪声模型(HNM)在串联文本到语音(TTS)合成中的应用。在HNM的上下文中,语音信号表示为随时间变化的谐波分量加调制后的噪声分量。将语音信号分解为这两个分量允许对信号进行更自然听起来的修改(例如,通过使用不同且更好地适应的方案来修改每个分量)。使用HNM进行语音参数化表示提供了一种直接方法,可以平滑连接点周围声学单元的不连续性。正式的听力测试表明,HNM可提供高质量的语音合成,同时在清晰度,自然度和愉悦性方面优于其他合成模型(例如TD-PSOLA)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号