Noise compensation for speech recognition with arbitrary additive noise

Ji Ming

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Noise compensation for speech recognition with arbitrary additive noise

【24h】

Noise compensation for speech recognition with arbitrary additive noise

机译：具有任意加性噪声的语音识别噪声补偿

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates speech recognition involving additive background noise, assuming no knowledge about the noise characteristics. A new method, namely universal compensation (UC), is proposed as a solution to the problem. The UC method is an extension of the missing-feature method, i.e., recognition based only on reliable data but robust to any corruption type, including full corruption in which the noise affects all time-frequency components of the speech representation. The UC technique achieves robustness to unknown, full noise corruption through a novel combination of the multicondition training method and the missing-feature method. Multicondition training is employed to convert fullband spectral corruption into partial-band spectral corruption, which is achieved by training the model using data involving simulated wide-band noise at different signal-to-noise ratios. The missing-feature principle is employed to reduce the effect of the remaining partial-band corruption on recognition by basing the recognition only on the matched or compensated spectral components from the multicondition training. The combination of these two strategies makes the new method potentially capable of dealing with arbitrary additive noise-with arbitrary temporal-spectral characteristics-based only on clean speech training data and simulated noise data, without requiring knowledge of the actual noise. Two databases, Aurora 2 and an E-set word database, have been used to evaluate the UC method. Experiments on Aurora 2 indicate that the new model has the potential to achieve a recognition performance close to the performance obtained by a multicondition baseline model trained using data involving the test environments. Further experiments for noise conditions unseen in Aurora 2 show significant performance improvement for the new model over the multicondition model. The experimental results on the E-set database demonstrate the ability of the UC model to deal with acoustically confusing recognition tasks.

机译：本文在不了解噪声特征的前提下，研究涉及加性背景噪声的语音识别。提出了一种新的方法，即通用补偿（UC），以解决该问题。 UC方法是缺失功能方法的扩展，即仅基于可靠的数据进行识别，但对任何损坏类型均具有鲁棒性，包括完全损坏，其中噪声会影响语音表示的所有时频分量。 UC技术通过多条件训练方法和缺失特征方法的新颖结合，实现了对未知，完全噪声破坏的鲁棒性。采用多条件训练将全频带频谱损坏转换为部分频带频谱损坏，这是通过使用涉及模拟信噪比的模拟宽带噪声的数据训练模型来实现的。通过仅基于多条件训练中匹配或补偿的频谱成分进行识别，采用缺失特征原理来减少剩余的部分频带损坏对识别的影响。这两种策略的结合使这种新方法有可能能够仅基于纯净语音训练数据和模拟噪声数据处理具有任意时间频谱特性的任意附加噪声，而无需了解实际噪声。已使用两个数据库Aurora 2和一个E-set单词数据库来评估UC方法。在Aurora 2上进行的实验表明，新模型有可能获得与使用涉及测试环境的数据训练的多条件基线模型所获得的性能接近的识别性能。在Aurora 2中看不到的噪声条件的进一步实验表明，与多条件模型相比，新模型的性能有了显着提高。在E-set数据库上的实验结果证明了UC模型处理声音混乱的识别任务的能力。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第3期|p.833-844|共12页
作者
Ji Ming;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
noise; speech recognition; arbitrary additive noise; fullband spectral corruption; missing-feature method; multicondition training method; noise compensation; partial-band spectral corruption; speech recognition; speech representation; temporal-spectral characteri;

机译：噪声;语音识别;任意加性噪声;全频带频谱破坏;缺失特征法;多条件训练法;噪声补偿;部分频带频谱破坏;语音识别;语音表示;时频谱特征;

相似文献

外文文献
中文文献
专利

1. Noise compensation for speech recognition with arbitrary additive noise [J] . J. Ming Electronics Letters . 2004,第3期

机译：具有任意加性噪声的语音识别噪声补偿
2. Cepstral Statistics Compensation and Normalization Using Online Pseudo Stereo Codebooks for Robust Speech Recognition in Additive Noise Environments [J] . Jeih-weih HUNG IEICE Transactions on Information and Systems . 2008,第2期

机译：使用在线伪立体声码本进行倒谱统计补偿和归一化，以在加性噪声环境中实现可靠的语音识别
3. Cepstral behaviour due to additive noise and a compensation scheme for noisy speech recognition [J] . Hwang T.-H., Lee L.-M. IEE Proceedings. Part K . 1998,第5期

机译：由于加性噪声引起的倒谱行为以及用于嘈杂语音识别的补偿方案
4. A NOISE REDUCTION SYSTEM IN ARBITRARY NOISE ENVIRONMENTS AND ITS APPLICATIONS TO SPEECH ENHANCEMENT AND SPEECH RECOGNITION [C] . Junfeng Li, Xugang Lu, Masato Akagi IEEE International Conference on Acoustics, Speech, and Signal Processing . 2005

机译：任意噪声环境中的降噪系统及其应用于语音增强和语音识别
5. Compensation for Nonlinear Distortion in Noise for Robust Speech Recognition. [D] . Harvilla, Mark J. 2014

机译：噪声中的非线性失真补偿，用于鲁棒的语音识别。
6. Speech-in-Noise Test results of compensation claimants for noise induced hearing loss in Korean male workers: Words-in-Noise Test (WIN) and quick-Hearing-in-Noise Test (HINT) [O] . Ji Soo Kim, Joong Keun Kwon, Nam Jeong Kim, 2021

机译：韩国男性工人噪声引起的噪声诱导损失的噪音索赔人的语音测试结果：单词 - 噪声测试（WIN）和快速听音 - 噪音测试（提示）
7. RESIDUAL NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION IN NONSTATIONARY NOISE [O] . Kaisheng Yao T, Bertram E. Shi T, Pascale Fung T, 2008

机译：非平稳噪声中鲁棒语音识别的残余噪声补偿
8. Removal of Noise from Noise-Degraded Speech Signals. Panel on Removal of Noise from a Speech/Noise Signal [R] . 1989

机译：降低噪声降级语音信号的噪声。消除语音/噪音信号噪音的小组

Noise compensation for speech recognition with arbitrary additive noise

摘要

著录项

相似文献

相关主题

期刊订阅