首页> 外文会议>INTERSPEECH 2012 >Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition
【24h】

Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition

机译:挂接具有听觉激发的稳健性的引擎的频谱时间过滤器,用于强大的自动语音识别

获取原文

摘要

Spectro-temporal filtering has been shown to result in features that can help to increase the robustness of automatic speech recognition (ASR) in the past. We replace the spectro-temporal representation used in previous work with spectrograms that incorporate knowledge about the signal processing of the human auditory system and which are derived from Power-Normalized Cep-stral Coefficients (PNCCs). 2D-Gabor filters are applied to these spectrograms to extract features- evaluated on a noisy digit recognition task. The filter bank is adapted to the new representation by optimizing the spectral m_odu-lation frequencies associated with each Gabor function. A comparison of optimized parameters and the spectral modulation of vowels shows a good match between optimized and expected range of frequencies. When processed with a non-linear neural net and combined with PNCCs, Gabor features decrease the error rate compared to the baseline and PNCCs by at least 19%.
机译:已经显示光谱 - 时间滤波导致可以有助于增加过去自动语音识别(ASR)的鲁棒性的功能。我们替换以前的工作中使用的光谱 - 时间表示与谱图,该谱图包含关于人类听觉系统的信号处理的知识,并且源自功率归一化的Cep-频系数(PNCC)。将2D-Gabor滤波器应用于这些谱图中以提取在嘈杂的数字识别任务上进行的特征。通过优化与每个Gabor函数相关联的光谱M_ODU-Latives频率,滤波器组适用于新的表示。优化参数的比较和元音的光谱调制显示优化和预期频率之间的良好匹配。当用非线性神经网络处理并与PNCC组合处理时,Gabor特征与基线和PNCC相比减少了至少19%的错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号