首页> 外文期刊>IEEE transactions on audio, speech and language processing >Noise compensation for speech recognition with arbitrary additive noise
【24h】

Noise compensation for speech recognition with arbitrary additive noise

机译:具有任意加性噪声的语音识别噪声补偿

获取原文
获取原文并翻译 | 示例
           

摘要

This paper investigates speech recognition involving additive background noise, assuming no knowledge about the noise characteristics. A new method, namely universal compensation (UC), is proposed as a solution to the problem. The UC method is an extension of the missing-feature method, i.e., recognition based only on reliable data but robust to any corruption type, including full corruption in which the noise affects all time-frequency components of the speech representation. The UC technique achieves robustness to unknown, full noise corruption through a novel combination of the multicondition training method and the missing-feature method. Multicondition training is employed to convert fullband spectral corruption into partial-band spectral corruption, which is achieved by training the model using data involving simulated wide-band noise at different signal-to-noise ratios. The missing-feature principle is employed to reduce the effect of the remaining partial-band corruption on recognition by basing the recognition only on the matched or compensated spectral components from the multicondition training. The combination of these two strategies makes the new method potentially capable of dealing with arbitrary additive noise-with arbitrary temporal-spectral characteristics-based only on clean speech training data and simulated noise data, without requiring knowledge of the actual noise. Two databases, Aurora 2 and an E-set word database, have been used to evaluate the UC method. Experiments on Aurora 2 indicate that the new model has the potential to achieve a recognition performance close to the performance obtained by a multicondition baseline model trained using data involving the test environments. Further experiments for noise conditions unseen in Aurora 2 show significant performance improvement for the new model over the multicondition model. The experimental results on the E-set database demonstrate the ability of the UC model to deal with acoustically confusing recognition tasks.
机译:本文在不了解噪声特征的前提下,研究涉及加性背景噪声的语音识别。提出了一种新的方法,即通用补偿(UC),以解决该问题。 UC方法是缺失功能方法的扩展,即仅基于可靠的数据进行识别,但对任何损坏类型均具有鲁棒性,包括完全损坏,其中噪声会影响语音表示的所有时频分量。 UC技术通过多条件训练方法和缺失特征方法的新颖结合,实现了对未知,完全噪声破坏的鲁棒性。采用多条件训练将全频带频谱损坏转换为部分频带频谱损坏,这是通过使用涉及模拟信噪比的模拟宽带噪声的数据训练模型来实现的。通过仅基于多条件训练中匹配或补偿的频谱成分进行识别,采用缺失特征原理来减少剩余的部分频带损坏对识别的影响。这两种策略的结合使这种新方法有可能能够仅基于纯净语音训练数据和模拟噪声数据处理具有任意时间频谱特性的任意附加噪声,而无需了解实际噪声。已使用两个数据库Aurora 2和一个E-set单词数据库来评估UC方法。在Aurora 2上进行的实验表明,新模型有可能获得与使用涉及测试环境的数据训练的多条件基线模型所获得的性能接近的识别性能。在Aurora 2中看不到的噪声条件的进一步实验表明,与多条件模型相比,新模型的性能有了显着提高。在E-set数据库上的实验结果证明了UC模型处理声音混乱的识别任务的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号