Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition

机译：连接具有听觉启发性表示的光谱时滤波器，以实现强大的自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spectro-temporal filtering has been shown to result in features that can help to increase the robustness of automatic speech recognition (ASR) in the past. We replace the spectro-temporal representation used in previous work with spectrograms that incorporate knowledge about the signal processing of the human auditory system and which are derived from Power-Normalized Cep-stral Coefficients (PNCCs). 2D-Gabor filters are applied to these spectrograms to extract features evaluated on a noisy digit recognition task. The filter bank is adapted to the new representation by optimizing the spectral modulation frequencies associated with each Gabor function. A comparison of optimized parameters and the spectral modulation of vowels shows a good match between optimized and expected range of frequencies. When processed with a non-linear neural net and combined with PNCCs, Gabor features decrease the error rate compared to the baseline and PNCCs by at least 19%.

机译：时空滤波已显示出可以帮助提高过去自动语音识别（ASR）鲁棒性的功能。我们用合并了有关人类听觉系统信号处理知识的频谱图替换了以前工作中使用的频谱时间表示形式，这些频谱图是从功率归一化倒谱系数（PNCC）得出的。将2D-Gabor滤波器应用于这些频谱图，以提取在嘈杂的数字识别任务上评估的特征。通过优化与每个Gabor函数相关的频谱调制频率，可以使滤波器组适应新的表示形式。优化参数和元音频谱调制的比较显示优化和预期频率范围之间的良好匹配。当使用非线性神经网络处理并与PNCC结合使用时，与基线和PNCC相比，Gabor特征可将错误率降低至少19％。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|1258-1261|共4页
会议地点
作者
Bernd T. Meyer; Constantin Spille; Birger Kollmeier; Nelson Morgan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
automatic speech recognition; spectro-temporal features; power-normalized features;

机译：自动语音识别;光谱时态特征功率归一化特征;

相似文献

外文文献
中文文献
专利

1. Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition [J] . Marc René Sch?dler, Bernd T. Meyer, Birger Kollmeier The Journal of the Acoustical Society of America . 2012,第5期

机译：频谱时间调制子空间跨度滤波器组功能，用于强大的自动语音识别
2. Auditory-Inspired Morphological Processing of Speech Spectrograms: Applications in Automatic Speech Recognition and Speech Enhancement [J] . Joyner Cadore, Francisco J. Valverde-Albacete, Ascensión Gallardo-Antolín, Cognitive Computation . 2013,第4期

机译：语音频谱图的听觉启发式形态处理：在自动语音识别和语音增强中的应用
3. Auditory-Inspired Morphological Processing of Speech Spectrograms: Applications in Automatic Speech Recognition and Speech Enhancement [J] . Joyner Cadore, Francisco J. Valverde-Albacete, Ascensión Gallardo-Antolín, Cognitive computation . 2013,第4期

机译：语音频谱图的听觉启发式形态处理：在自动语音识别和语音增强中的应用
4. Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition [C] . Bernd T.Meyer, Constantin Spille, Birger Kollmeier, INTERSPEECH 2012 . 2012

机译：挂接具有听觉激发的稳健性的引擎的频谱时间过滤器，用于强大的自动语音识别
5. Array-based Spectro-temporal Masking for Automatic Speech Recognition. [D] . Moghimi, Amir R. 2014

机译：基于阵列的频谱时域掩蔽，用于自动语音识别。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement [O] . Cadore Joyner, Valverde-Albacete Francisco J., Gallardo-Antolín Ascensión, 2012

机译：听觉启发的语音频谱图形态处理：自动语音识别和语音增强中的应用

Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅