首页> 外国专利> Parametric speech codec for representing synthetic speech in the presence of background noise

Parametric speech codec for representing synthetic speech in the presence of background noise

机译:用于在存在背景噪声的情况下表示合成语音的参数语音编解码器

摘要

A system and method are provided for processing audio and speech signals using a pitch and voicing dependent spectral estimation algorithm (voicing algorithm) to accurately represent voiced speech, unvoiced speech, and mixed speech in the presence of background noise, and background noise with a single model. The present invention also modifies the synthesis model based on an estimate of the current input signal to improve the perceptual quality of the speech and background noise under a variety of input conditions. The present invention also improves the voicing dependent spectral estimation algorithm robustness by introducing the use of a Multi-Layer Neural Network in the estimation process. The voicing dependent spectral estimation algorithm provides an accurate and robust estimate of the voicing probability under a variety of background noise conditions. This is essential to providing high quality intelligible speech in the presence of background noise.
机译:提供了一种系统和方法,该系统和方法用于使用音高和发声相关的频谱估计算法(发声算法)来处理音频和语音信号,以在存在背景噪声和单个背景噪声的情况下准确地表示浊语音,清语音和混合语音模型。本发明还基于当前输入信号的估计来修改合成模型,以在各种输入条件下改善语音和背景噪声的感知质量。通过在估计过程中引入多层神经网络,本发明还改善了与语音相关的频谱估计算法的鲁棒性。依赖于语音的频谱估计算法可在各种背景噪声条件下提供准确而可靠的语音概率估计。这对于在存在背景噪声的情况下提供高质量的可理解语音至关重要。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号