首页> 外文会议>IEEE International Conference on Robotics and Biomimetics >Maxout neurons based deep bidirectional LSTM for acoustic modeling
【24h】

Maxout neurons based deep bidirectional LSTM for acoustic modeling

机译:基于Maxout神经元的深度双向LSTM用于声学建模

获取原文

摘要

Recently long short-term memory (LSTM) recurrent neural networks (RNN) have achieved greater success in acoustic models for the large vocabulary continuous speech recognition system. In this paper, we propose an improved hybrid acoustic model based on deep bidirectional long short-term memory (DBLSTM) RNN. In this new acoustic model, maxout neurons are used in the fully-connected part of DBLSTM to solve the problems of vanishing and exploding gradient. At the same time, the dropout regularization algorithm is used to avoid the over-fitting during the training process of neural network. In addition, in order to adapt the bidirectional dependence of DBLSTM at each time step, a context-sensitive-chunk (CSC) back-propagation through time (BPTT) algorithm is proposed to train DBLSTM neural network. Simulation experiments have been made on Switchboard benchmark task. The results show that the WER of the improved hybrid acoustic model is 14.5%, and the optimal network structures and CSC configurations are given.
机译:最近,长短期记忆(LSTM)递归神经网络(RNN)在大型词汇连续语音识别系统的声学模型中取得了更大的成功。在本文中,我们提出了一种基于深度双向长短期记忆(DBLSTM)RNN的改进的混合声学模型。在这个新的声学模型中,在DBLSTM的完全连接部分中使用了maxout神经元来解决梯度消失和爆炸的问题。同时,采用丢包正则化算法避免神经网络训练过程中的过度拟合。此外,为了适应DBLSTM在每个时间步长上的双向依赖性,提出了一种上下文敏感块(CSC)随时间反向传播(BPTT)算法来训练DBLSTM神经网络。已经对Switchboard基准测试任务进行了仿真实验。结果表明,改进后的混合声学模型的WER为14.5%,并给出了最优的网络结构和CSC配置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号