Speech Enhancement Method Based On LSTM Neural Network for Speech Recognition

机译：基于LSTM神经网络的语音增强语音识别方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Long Short-Term Memory (LSTM), a special kind of Recurrent Neural Network (RNN), is capable of learning long-term dependencies. In this paper, a kind of speech enhancement method is proposed for LSTM network structure to cope with the speech features, with the purpose of improving the speech recognition rate. This method utilizes the LSTM structure in reference to the acoustic model and crossover residual network to construct the front-end enhancement module. We trained and compared DNN, CNN, LSTM and BLSTM models with various numbers of parameters. The experimental results show that, the LSTM model performs the best in the test set and the real scene. The noise reduction effects are the best when the noise is reduced from 31.23% to 25.89% on the Xiaomi speaker test set¹.

机译：长期内记忆（LSTM），一种特殊的经常性神经网络（RNN），能够学习长期依赖性。在本文中，提出了一种语音增强方法，用于应对语音特征的LSTM网络结构，以提高语音识别率的目的。该方法利用LSTM结构参考声学模型和交叉剩余网络来构建前端增强模块。我们培训和比较了DNN，CNN，LSTM和BLSTM模型，具有各种参数。实验结果表明，LSTM模型在测试集和真实场景中执行最佳。当噪声减少到小迈扬声器测试集中的噪声从31.23 \％降低到25.89 \％时，降噪效果最佳 ^{1
。}

著录项

来源
《IEEE International Conference on Signal Processing》|2018年|245-249|共5页
会议地点
作者
Ming Liu; Yujun Wang; Jin Wang; Jing Wang; Xiang Xie;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neural networks; Speech enhancement; Training; Speech recognition; Data models; Acoustics; Computer architecture;

机译：神经网络;语音增强;训练;语音识别;数据模型;声学;计算机体系结构;

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of Deep Neural Networks Applied to Speech Recognition: RNN, LSTM and GRU [J] . Apeksha Shewalkar, Deepika Nyavanandi, Simone A. Ludwig Journal of Artificial Intelligence and Soft Computing Research . 2019,第4期

机译：深度神经网络在语音识别中的性能评估：RNN，LSTM和GRU
2. Combination of GMM-Based Speech Estimation Method and Temporal Domain SVD-Based Speech Enhancement for Noise Robust Speech Recognition [J] . Masakiyo Fujimoto, Yasuo Ariki Systems and Computers in Japan . 2007,第3期

机译：基于GMM的语音估计方法与基于时域SVD的语音增强相结合的噪声鲁棒语音识别
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. Speech Enhancement Method Based On LSTM Neural Network for Speech Recognition [C] . Ming Liu, Yujun Wang, Jin Wang, IEEE International Conference on Signal Processing . 2018

机译：基于LSTM神经网络的语音识别语音增强方法
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Speaker-Independent Silent Speech Recognition from Flesh-Point Articulatory Movements Using an LSTM NeuralNetwork [O] . Myungjong Kim, Beiming Cao, Ted Mau, -1

机译：使用LSTM神经从肉点发音运动中独立于说话者的沉默语音识别网络
7. EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION [O] . Deepak Baby, Jort F. Gemmeke, Tuomas Virtanen, 2016

机译：基于示例的语音增强在深度神经网络自动语音识别中的应用

Speech Enhancement Method Based On LSTM Neural Network for Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅