首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >SPEECH FEATURE DENOISING AND DEREVERBERATION VIA DEEP AUTOENCODERS FOR NOISY REVERBERANT SPEECH RECOGNITION
【24h】

SPEECH FEATURE DENOISING AND DEREVERBERATION VIA DEEP AUTOENCODERS FOR NOISY REVERBERANT SPEECH RECOGNITION

机译:通过深度自动控制仪进行嘈杂的混响语音识别的语音功能去噪

获取原文

摘要

Denoising autoencoders (DAs) have shown success in generating robust features for images, but there has been limited work in applying DAs for speech. In this paper we present a deep denoising autoencoder (DDA) framework that can produce robust speech features for noisy reverberant speech recognition. The DDA is first pre-trained as restricted Boltzmann machines (RBMs) in an unsupervised fashion. Then it is unrolled to autoencoders, and fine-tuned by corresponding clean speech features to learn a nonlinear mapping from noisy to clean features. Acoustic models are re-trained using the reconstructed features from the DDA, and speech recognition is performed. The proposed approach is evaluated on the CHiME-WSJO corpus, and shows a 16-25% absolute improvement on the recognition accuracy under various SNRs.
机译:去噪AutoEncoders(DAS)在为图像产生强大功能方面表现出成功,但在应用DAS进行语音方面存在有限的工作。在本文中,我们提出了一个深深的自动化器(DDA)框架,可以为嘈杂的混响语音识别产生强大的语音功能。 DDA首先以无监督的方式预先培训为受限制的Boltzmann机器(RBMS)。然后它展开到AutoEncoders,并通过相应的清洁语音特征进行微调,以学习从嘈杂到清洁功能的非线性映射。使用来自DDA的重建特征重新训练声学模型,并执行语音识别。所提出的方法是在Chime-WSJO语料库上进行评估,并显示各种SNRS下识别准确性的16-25%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号