首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition
【24h】

Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition

机译:基于丢失特征的鲁棒语音识别的非平稳噪声环境中的掩模估计

获取原文

摘要

In missing feature based automatic speech recognition (ASR), the role of the spectro-temporal mask in providing an accurate description of the relationship between target speech and environmental noise is critical for minimizing the degradation in ASR word accuracy (WAC) as the signal-to-noise ratio (SNR) decreases. This paper demonstrates the importance of accurate characterization of instantaneous acoustic background for mask estimation in data imputation approaches to missing feature based ASR, especially in the presence of non-stationary background noise. Mask estimation relies on a hypothesis test designed to detect the presence of speech in time-frequency spectral bins under rapidly varying noise conditions. Masked mel-frequency filter bank energies are reconstructed using a minimum mean squared error (MMSE) based data imputation procedure. The impact of this mask estimation approach is evaluated in the context of MMSE based data imputation under multiple background conditions over a range of SNRs using the Aurora 2 speech corpus.
机译:在缺少基于特征的自动语音识别(ASR)的过程中,频谱时域掩码在准确描述目标语音与环境噪声之间的关系中的作用对于最大程度地降低ASR字准确度(WAC)的下降至关重要,因为信噪比(SNR)降低。本文证明了准确表征瞬时声学背景对于在基于缺失特征的ASR的数据归算方法中进行掩码估计的重要性,尤其是在存在非平稳背景噪声的情况下。掩码估计依赖于一种假设检验,该假设检验旨在在快速变化的噪声条件下检测时频频谱仓中语音的存在。使用基于最小均方误差(MMSE)的数据插补程序来重建掩蔽的梅尔频率滤波器组能量。使用Aurora 2语音语料库,在多种背景条件下,在一定范围的SNR范围内,基于MMSE的数据插补中评估了此掩码估计方法的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号