首页> 外文期刊>IEICE Transactions on Information and Systems >Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition
【24h】

Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition

机译:倒谱幅度范围归一化,用于噪声鲁棒语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a noise robustness technique that normalizes the cepstral amplitude range in order to remove the influence of additive noise. Additive noise causes speech feature mismatches between testing and training environments and it degrades recognition accuracy in noisy environments. We presume an approximate model that expresses the influence by changing the amplitude range and the DC component in the log-spectra. According to this model, we propose a cepstral amplitude range normalization (CARN) that normalizes the cepstral distance between maximum and minimum values. It can estimate noise robust features without prior knowledge or adaptation. We evaluated its performance in an isolated word recognition task by using the Noisex92 database. Compared with the combinations of conventional methods, the CARN could improve recognition accuracy under various SNR conditions.
机译:本文介绍了一种噪声鲁棒性技术,该技术可对倒频谱幅度范围进行归一化,以消除附加噪声的影响。加性噪声会导致测试和培训环境之间的语音功能不匹配,并且会在嘈杂的环境中降低识别精度。我们假设一个近似模型,通过改变对数谱中的幅度范围和直流分量来表达影响。根据此模型,我们提出了一个倒频谱幅度范围归一化(CARN),可以对最大值和最小值之间的倒频谱距离进行归一化。它可以估计噪声鲁棒性特征,而无需先验知识或自适应。通过使用Noisex92数据库,我们在孤立的单词识别任务中评估了它的性能。与传统方法的组合相比,CARN可以在各种SNR条件下提高识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号