首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition
【24h】

A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition

机译:CHiME-3挑战系统:长期的声学功能可实现强大的噪声自动语音识别

获取原文

摘要

The paper describes an automatic speech recognition (ASR) system for the 3rd CHiME challenge that addresses noisy acoustic scenes within public environments. The proposed system includes a multi-channel speech enhancement front-end including a microphone channel failure detection method that is based on cross-comparing the modulation spectra of speech to detect erroneous microphone recordings. The main focus of the submission is the investigation of the amplitude modulation filter bank (AMFB) as a method to extract long-term acoustic cues prior to a Gaussian mixture model (GMM) or deep neural network (DNN) based ASR classifier. It is shown that AMFB features outperform the commonly used frame splicing technique of filter bank features even on a performance optimized ASR challenge system. I.e., temporal analysis of speech by hand-crafted and auditory motivated AMFBs is shown to be more robust compared to a data-driven method based on extracting temporal dynamics with a DNN. Our final ASR system, which additionally includes adaptation of acoustic features to speaker characteristics, achieves an absolute word error rate reduction of approx. 21.53 % relative to the best CHiME-3 baseline system on the "real" test condition.
机译:本文介绍了针对第三次CHiME挑战的自动语音识别(ASR)系统,该系统解决了公共环境中嘈杂的声学场景。所提出的系统包括多通道语音增强前端,该前端包括麦克风通道故障检测方法,该方法基于交叉比较语音的调制频谱以检测错误的麦克风记录。提交的主要焦点是对调幅滤波器组(AMFB)的研究,该方法是一种在基于高斯混合模型(GMM)或基于深度神经网络(DNN)的ASR分类器之前提取长期声学提示的方法。结果表明,即使在性能经过优化的ASR质询系统上,AMFB功能也优于常用的滤波器组功能的帧拼接技术。即,与基于DNN提取时间动态的数据驱动方法相比,手工制作和听觉上有动机的AMFB对语音的时间分析显示出更强大的功能。我们最终的ASR系统还包括声学特性与扬声器特性的匹配,可将绝对误码率降低约5%。相对于“真实”测试条件下的最佳CHiME-3基线系统而言,为21.53%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号