首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >On using the auditory image model and invariant-integration for noise robust automatic speech recognition
【24h】

On using the auditory image model and invariant-integration for noise robust automatic speech recognition

机译:利用听觉图像模型和不变积分进行噪声鲁棒的自动语音识别

获取原文
获取原文并翻译 | 示例

摘要

Commonly used feature extraction methods for automatic speech recognition (ASR) incorporate only rudimentary psychoacoustic findings. Several works showed that a physiologically closer auditory processing during the feature extraction stage can enhance the robustness of an ASR system in noisy environments. The “auditory image model” (AIM) is such a more sophisticated computational model. In this work we show how invariant integration can be applied in the feature space given by the AIM, and we analyze the performance of the resulting features under noisy conditions on the Aurora-2 task. Furthermore, we show that previously presented features based on power-normalization and invariant integration benefit from the AIM-based integration features when the feature vectors are combined with each other.
机译:常用的自动语音识别(ASR)特征提取方法仅包含基本的心理声学发现。多项工作表明,在特征提取阶段进行生理上更紧密的听觉处理可以增强ASR系统在嘈杂环境中的鲁棒性。 “听觉图像模型”(AIM)是一种更复杂的计算模型。在这项工作中,我们展示了如何在AIM给定的特征空间中应用不变积分,并且我们分析了Aurora-2任务在嘈杂条件下所得特征的性能。此外,我们表明,当特征向量彼此组合时,先前基于功率归一化和不变积分的特征将受益于基于AIM的积分特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号