首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition
【24h】

Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition

机译:使用深度神经网络瓶颈特征进行学习特征映射以实现远距离大词汇量语音识别

获取原文

摘要

Automatic speech recognition from distant microphones is a difficult task because recordings are affected by reverberation and background noise. First, the application of the deep neural network (DNN)/hidden Markov model (HMM) hybrid acoustic models for distant speech recognition task using AMI meeting corpus is investigated. This paper then proposes a feature transformation for removing reverberation and background noise artefacts from bottleneck features using DNN trained to learn the mapping between distant-talking speech features and close-talking speech bottleneck features. Experimental results on AMI meeting corpus reveal that the mismatch between close-talking and distant-talking conditions is largely reduced, with about 16% relative improvement over conventional bottleneck system (trained on close-talking speech). If the feature mapping is applied to close-talking speech, a minor degradation of 4% relative is observed.
机译:来自远处麦克风的自动语音识别是一项艰巨的任务,因为录音会受到混响和背景噪声的影响。首先,研究了深度神经网络(DNN)/隐马尔可夫模型(HMM)混合声学模型在使用AMI会议语料的远程语音识别任务中的应用。然后,本文提出了一种特征变换,该特征变换使用DNN进行训练,以消除瓶颈特征中的混响和背景噪声伪像,该DNN可以学习远距离语音特征和近距离语音瓶颈特征之间的映射。在AMI会议语料库上的实验结果表明,近距离交谈和远距离交谈条件之间的失配已大大降低,与传统的瓶颈系统(在近距离交谈语音上进行训练)相比,相对改善了约16%。如果将特征映射应用于近距离讲话,则会观察到相对降低4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号