Time-frequency correlation based missing-feature reconstruction for robust speech recognition in background noise conditions

机译：基于时频相关的缺失特征重建在背景噪声条件下的鲁棒语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study proposes a novel missing-feature reconstruction method to improve speech recognition in background noise environments. In order to improve the existing missing-feature reconstruction method which utilizes only frequency correlation, a temporal spectral feature analysis is employed to leverage temporal correlation across neighboring frames. The final estimates for missing-feature reconstruction are obtained by a selective combination of the frequency correlation based method and the proposed temporal correlation based method. Performance of the proposed method is evaluated using the Aurora 2.0 framework with car noise and speech babble conditions. Experimental results demonstrate that the proposed method is more effective at increasing speech recognition performance in adverse conditions. By employing the proposed temporal-frequency based reconstruction method with SNR-based mask estimation, +21.31% and +20.73% average relative improvements in WER are obtained for car and speech babble conditions, compared to the original frequency correlation based method.

机译：这项研究提出了一种新颖的缺失特征重建方法，以改善背景噪声环境下的语音识别。为了改进现有的仅利用频率相关性的缺失特征重建方法，采用时间频谱特征分析来利用跨相邻帧的时间相关性。通过基于频率相关的方法和建议的基于时间相关的方法的选择性组合，可以获得缺失特征重构的最终估计值。使用Aurora 2.0框架在汽车噪音和语音ba语条件下评估了所提出方法的性能。实验结果表明，该方法在不利条件下能有效提高语音识别性能。与原始的基于频率相关性的方法相比，通过采用所提出的基于时频的重建方法和基于SNR的掩码估计，在汽车和语音ba语条件下，WER的平均相对改善率分别为+ 21.31％和+ 20.73％。

著录项

来源
《Asilomar Conference on Signals, Systems and Computers》|2009年|P.1762-1765|共4页
会议地点
作者
Kim Wooil; Hansen John H.L.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Time–Frequency Correlation-Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions [J] . Kim W., Hansen J. H. L. Audio, Speech, and Language Processing, IEEE Transactions on . 2009,第7期

机译：基于时频相关的丢失特征重建在频带受限条件下的鲁棒语音识别
2. MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition [J] . Gonzalez J. A., Peinado A. M., Ma N., Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第3期

机译：基于MMSE的缺失特征重建与时间建模，用于鲁棒语音识别
3. Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition [J] . Gonzalez Jose A., Gomez Angel M., Peinado Antonio M., Circuits, systems, and signal processing . 2017,第9期

机译：基于掩蔽模型的谱重构和噪声模型估计，用于噪声鲁棒语音识别
4. Mask estimation employing Posterior-based Representative Mean for missing-feature speech recognition with time-varying background noise [C] . Kim Wooil, Hansen John H.L. Automatic Speech Recognition amp; Understanding, 2009. ASRU 2009 . 2009

机译：时变背景噪声下基于后验的代表均值的语音特征缺失语音识别的掩码估计
5. Multi-microphone correlation-based processing for robust automatic speech recognition. [D] . Sullivan, Thomas M. 1996

机译：基于多麦克风相关性的处理可实现强大的自动语音识别。
6. Speech Perception for Adult Cochlear Implant Recipients in a Realistic Background Noise: Effectiveness of Preprocessing Strategies and External Options for Improving Speech Recognition in Noise [O] . René H. Gifford, Lawrence J. Revit -1

机译：成人耳蜗植入者在现实背景噪声中的言语感知：预处理策略和外部选择改善噪声语音识别的有效性
7. Improved Autocorrelation-Based Noise Robust Speech Recognition Using Kernel-Based Cross Correlation and Overestimation Parameters [O] . Farahani Gholamreza, Ahadi Mohammad, Homayounpour M. Mehdi 2007

机译：使用基于核的互相关和高估参数的改进的基于自相关的噪声鲁棒语音识别

Time-frequency correlation based missing-feature reconstruction for robust speech recognition in background noise conditions

摘要

著录项

相似文献

相关主题

期刊订阅