Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition

机译：基于丢失特征的鲁棒语音识别的非平稳噪声环境中的掩模估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In missing feature based automatic speech recognition (ASR), the role of the spectro-temporal mask in providing an accurate description of the relationship between target speech and environmental noise is critical for minimizing the degradation in ASR word accuracy (WAC) as the signal-to-noise ratio (SNR) decreases. This paper demonstrates the importance of accurate characterization of instantaneous acoustic background for mask estimation in data imputation approaches to missing feature based ASR, especially in the presence of non-stationary background noise. Mask estimation relies on a hypothesis test designed to detect the presence of speech in time-frequency spectral bins under rapidly varying noise conditions. Masked mel-frequency filter bank energies are reconstructed using a minimum mean squared error (MMSE) based data imputation procedure. The impact of this mask estimation approach is evaluated in the context of MMSE based data imputation under multiple background conditions over a range of SNRs using the Aurora 2 speech corpus.

机译：在缺少基于特征的自动语音识别（ASR）的过程中，频谱时域掩码在准确描述目标语音与环境噪声之间的关系中的作用对于最大程度地降低ASR字准确度（WAC）的下降至关重要，因为信噪比（SNR）降低。本文证明了准确表征瞬时声学背景对于在基于缺失特征的ASR的数据归算方法中进行掩码估计的重要性，尤其是在存在非平稳背景噪声的情况下。掩码估计依赖于一种假设检验，该假设检验旨在在快速变化的噪声条件下检测时频频谱仓中语音的存在。使用基于最小均方误差（MMSE）的数据插补程序来重建掩蔽的梅尔频率滤波器组能量。使用Aurora 2语音语料库，在多种背景条件下，在一定范围的SNR范围内，基于MMSE的数据插补中评估了此掩码估计方法的影响。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2062-2065|共4页
会议地点
作者
Shirin Badiezadegan; Richard C. Rose;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
automatic speech recognition; missing feature techniques; soft mask estimation; spectrogram reconstruction;

机译：自动语音识别;缺少特征技术;软掩模估计;频谱图重建;

相似文献

外文文献
中文文献
专利

1. Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition [J] . Gonzalez Jose A., Gomez Angel M., Peinado Antonio M., Circuits, systems, and signal processing . 2017,第9期

机译：基于掩蔽模型的谱重构和噪声模型估计，用于噪声鲁棒语音识别
2. A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition [J] . Wooil Kim, Hansen J.H.L. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：一种基于后验的代表性均值估计的新的掩模估计方法用于特征缺失语音识别
3. Feature classification criterion for missing features mask estimation in robust speaker recognition - Springer [J] . Dayana Ribas González, José Ramón Calvo de Lara Signal, Image and Video Processing . 2014,第2期

机译：健壮的说话人识别中缺少特征蒙版估计的特征分类标准-Springer
4. Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition [C] . Shirin Badiezadegan, Richard C. Rose Annual conference of the International Speech Communication Association . 2010

机译：基于缺失的功能的强大语音识别的非静止噪声环境中的掩模估计
5. Duration normalization for robust recognition of spontaneous speech via missing feature methods. [D] . Nedel, Jon P. 2004

机译：持续时间归一化，可通过缺失特征方法对自发语音进行可靠识别。
6. A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：在带通滤波调制多流功能根据框架鲁棒语音识别
7. Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition [O] . Gonzalez, J.A., Gómez, A.M., Peinado, A.M., 2017

机译：基于掩蔽模型的噪声鲁棒语音识别谱重建与噪声模型估计

Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅