首页> 外文会议>Conference of the International Speech Communication Association >Statistical Synthesizer with Embedded Prosodic and Spectral Modifications to Generate Highly Intelligible Speech in Noise
【24h】

Statistical Synthesizer with Embedded Prosodic and Spectral Modifications to Generate Highly Intelligible Speech in Noise

机译:统计合成器具有嵌入式韵律和光谱修改,以产生高度可理解的噪音语音

获取原文

摘要

This paper describes a statistical parametric speech synthesizer that, despite having been trained on an ordinary synthesis database and without any adaptation data, is able to generate highly intelligible speech in noisy environments. By using a simple and flexible vocoder based on a harmonic model, it applies several noise-independent modifications to durations, pitch level and range, energy contour, formant sharpness, and intensity of particular spectral bands. The system has been evaluated by means of a large subjective test, the results of which show that the suggested approach clearly outperforms the reference TTS systems and even unmodified natural speech in some conditions.
机译:本文介绍了统计参数语音合成器,尽管已经在普通综合数据库上培训并且没有任何适应数据,但能够在嘈杂的环境中产生高度可理解的语音。通过使用基于谐波模型的简单且灵活的声码器,它将多个无关的噪声修改应用于特定光谱带的持续时间,俯仰水平和范围,能量轮廓,形成强度和强度。该系统已经通过大型主观测试评估,结果表明,建议的方法在某些条件下显然优于参考TTS系统,甚至未修改的自然语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号