首页> 外文会议>INTERSPEECH 2012 >Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition

【24h】

Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition

机译：挂接具有听觉激发的稳健性的引擎的频谱时间过滤器，用于强大的自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spectro-temporal filtering has been shown to result in features that can help to increase the robustness of automatic speech recognition (ASR) in the past. We replace the spectro-temporal representation used in previous work with spectrograms that incorporate knowledge about the signal processing of the human auditory system and which are derived from Power-Normalized Cep-stral Coefficients (PNCCs). 2D-Gabor filters are applied to these spectrograms to extract features- evaluated on a noisy digit recognition task. The filter bank is adapted to the new representation by optimizing the spectral m_odu-lation frequencies associated with each Gabor function. A comparison of optimized parameters and the spectral modulation of vowels shows a good match between optimized and expected range of frequencies. When processed with a non-linear neural net and combined with PNCCs, Gabor features decrease the error rate compared to the baseline and PNCCs by at least 19%.

机译：已经显示光谱 - 时间滤波导致可以有助于增加过去自动语音识别（ASR）的鲁棒性的功能。我们替换以前的工作中使用的光谱 - 时间表示与谱图，该谱图包含关于人类听觉系统的信号处理的知识，并且源自功率归一化的Cep-频系数（PNCC）。将2D-Gabor滤波器应用于这些谱图中以提取在嘈杂的数字识别任务上进行的特征。通过优化与每个Gabor函数相关联的光谱M_ODU-Latives频率，滤波器组适用于新的表示。优化参数的比较和元音的光谱调制显示优化和预期频率之间的良好匹配。当用非线性神经网络处理并与PNCC组合处理时，Gabor特征与基线和PNCC相比减少了至少19％的错误率。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Bernd T.Meyer; Constantin Spille; Birger Kollmeier; Nelson Morgan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
automatic speech recognition; spectrotemporal features; power-normalized features;

机译：自动语音识别;分光仪功能;功率标准化功能;

相似文献

外文文献
中文文献
专利

1. Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition [J] . Marc René Sch?dler, Bernd T. Meyer, Birger Kollmeier The Journal of the Acoustical Society of America . 2012,第5期

机译：频谱时间调制子空间跨度滤波器组功能，用于强大的自动语音识别
2. Auditory-Inspired Morphological Processing of Speech Spectrograms: Applications in Automatic Speech Recognition and Speech Enhancement [J] . Joyner Cadore, Francisco J. Valverde-Albacete, Ascensión Gallardo-Antolín, Cognitive Computation . 2013,第4期

机译：语音频谱图的听觉启发式形态处理：在自动语音识别和语音增强中的应用
3. Auditory-Inspired Morphological Processing of Speech Spectrograms: Applications in Automatic Speech Recognition and Speech Enhancement [J] . Joyner Cadore, Francisco J. Valverde-Albacete, Ascensión Gallardo-Antolín, Cognitive computation . 2013,第4期

机译：语音频谱图的听觉启发式形态处理：在自动语音识别和语音增强中的应用
4. Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition [C] . Bernd T. Meyer, Constantin Spille, Birger Kollmeier, Annual conference of the International Speech Communication Association . 2012

机译：连接具有听觉启发性表示的光谱时滤波器，以实现强大的自动语音识别
5. Array-based Spectro-temporal Masking for Automatic Speech Recognition. [D] . Moghimi, Amir R. 2014

机译：基于阵列的频谱时域掩蔽，用于自动语音识别。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement [O] . Cadore Joyner, Valverde-Albacete Francisco J., Gallardo-Antolín Ascensión, 2012

机译：听觉启发的语音频谱图形态处理：自动语音识别和语音增强中的应用

Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅