首页> 外文会议>IEEE International Conference on Signal Processing >Speech Enhancement Method Based On LSTM Neural Network for Speech Recognition
【24h】

Speech Enhancement Method Based On LSTM Neural Network for Speech Recognition

机译:基于LSTM神经网络的语音增强语音识别方法

获取原文

摘要

Long Short-Term Memory (LSTM), a special kind of Recurrent Neural Network (RNN), is capable of learning long-term dependencies. In this paper, a kind of speech enhancement method is proposed for LSTM network structure to cope with the speech features, with the purpose of improving the speech recognition rate. This method utilizes the LSTM structure in reference to the acoustic model and crossover residual network to construct the front-end enhancement module. We trained and compared DNN, CNN, LSTM and BLSTM models with various numbers of parameters. The experimental results show that, the LSTM model performs the best in the test set and the real scene. The noise reduction effects are the best when the noise is reduced from 31.23% to 25.89% on the Xiaomi speaker test set1.
机译:长期内记忆(LSTM),一种特殊的经常性神经网络(RNN),能够学习长期依赖性。在本文中,提出了一种语音增强方法,用于应对语音特征的LSTM网络结构,以提高语音识别率的目的。该方法利用LSTM结构参考声学模型和交叉剩余网络来构建前端增强模块。我们培训和比较了DNN,CNN,LSTM和BLSTM模型,具有各种参数。实验结果表明,LSTM模型在测试集和真实场景中执行最佳。当噪声减少到小迈扬声器测试集中的噪声从31.23 \%降低到25.89 \%时,降噪效果最佳 1

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号