SPEECH FEATURE DENOISING AND DEREVERBERATION VIA DEEP AUTOENCODERS FOR NOISY REVERBERANT SPEECH RECOGNITION

机译：通过深度自动控制仪进行嘈杂的混响语音识别的语音功能去噪

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Denoising autoencoders (DAs) have shown success in generating robust features for images, but there has been limited work in applying DAs for speech. In this paper we present a deep denoising autoencoder (DDA) framework that can produce robust speech features for noisy reverberant speech recognition. The DDA is first pre-trained as restricted Boltzmann machines (RBMs) in an unsupervised fashion. Then it is unrolled to autoencoders, and fine-tuned by corresponding clean speech features to learn a nonlinear mapping from noisy to clean features. Acoustic models are re-trained using the reconstructed features from the DDA, and speech recognition is performed. The proposed approach is evaluated on the CHiME-WSJO corpus, and shows a 16-25% absolute improvement on the recognition accuracy under various SNRs.

机译：去噪AutoEncoders（DAS）在为图像产生强大功能方面表现出成功，但在应用DAS进行语音方面存在有限的工作。在本文中，我们提出了一个深深的自动化器（DDA）框架，可以为嘈杂的混响语音识别产生强大的语音功能。 DDA首先以无监督的方式预先培训为受限制的Boltzmann机器（RBMS）。然后它展开到AutoEncoders，并通过相应的清洁语音特征进行微调，以学习从嘈杂到清洁功能的非线性映射。使用来自DDA的重建特征重新训练声学模型，并执行语音识别。所提出的方法是在Chime-WSJO语料库上进行评估，并显示各种SNRS下识别准确性的16-25％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年||共5页
会议地点
作者
Xue Feng; Yaodong Zhang; James Glass;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature [J] . Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara EURASIP journal on advances in signal processing . 2015,第1期

机译：结合了深度神经网络和深度自动编码器的混响语音识别，并增强了电话类功能
2. Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization [J] . Ueda Yuma, Wang Longbiao, Kai Atsuhiko, Journal of signal processing systems for signal, image, and video technology . 2016,第2期

机译：结合去噪自动编码器和时间结构归一化的单通道去混响用于远距离语音识别
3. Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation [J] . Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, EURASIP journal on advances in signal processing . 2016,第1期

机译：使用动态特征增强和识别的语音去混响约束深度神经网络和特征自适应
4. Speech feature denoising and dereverberation via deep autoencoders for noisy reverberant speech recognition [C] . Feng Xue, Zhang Yaodong, Glass James IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：通过深度自动编码器对语音特征进行去噪和去混响，以实现嘈杂的混响语音识别
5. Deep Learning Methods for Reverberant and Noisy Speech Enhancement [D] . Zhao, Yan. 2020

机译：混响和嘈杂语音增强的深度学习方法
6. A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions [O] . Yan Zhao, DeLiang Wang, Eric M. Johnson, -1

机译：一种基于深度学习的分离算法可在混响嘈杂的情况下提高听力障碍听众的语音清晰度
7. SPEECH FEATURE DENOISING AND DEREVERBERATION VIA DEEP AUTOENCODERS FOR NOISY REVERBERANT SPEECH RECOGNITION [O] . Xue Feng, Yaodong Zhang, James Glass 2014

机译：通过深度自动调节器进行语音特征去噪和降级以进行噪音混响语音识别

SPEECH FEATURE DENOISING AND DEREVERBERATION VIA DEEP AUTOENCODERS FOR NOISY REVERBERANT SPEECH RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅