Maxout neurons based deep bidirectional LSTM for acoustic modeling

机译：基于Maxout神经元的深度双向LSTM用于声学建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently long short-term memory (LSTM) recurrent neural networks (RNN) have achieved greater success in acoustic models for the large vocabulary continuous speech recognition system. In this paper, we propose an improved hybrid acoustic model based on deep bidirectional long short-term memory (DBLSTM) RNN. In this new acoustic model, maxout neurons are used in the fully-connected part of DBLSTM to solve the problems of vanishing and exploding gradient. At the same time, the dropout regularization algorithm is used to avoid the over-fitting during the training process of neural network. In addition, in order to adapt the bidirectional dependence of DBLSTM at each time step, a context-sensitive-chunk (CSC) back-propagation through time (BPTT) algorithm is proposed to train DBLSTM neural network. Simulation experiments have been made on Switchboard benchmark task. The results show that the WER of the improved hybrid acoustic model is 14.5%, and the optimal network structures and CSC configurations are given.

机译：最近，长短期记忆（LSTM）递归神经网络（RNN）在大型词汇连续语音识别系统的声学模型中取得了更大的成功。在本文中，我们提出了一种基于深度双向长短期记忆（DBLSTM）RNN的改进的混合声学模型。在这个新的声学模型中，在DBLSTM的完全连接部分中使用了maxout神经元来解决梯度消失和爆炸的问题。同时，采用丢包正则化算法避免神经网络训练过程中的过度拟合。此外，为了适应DBLSTM在每个时间步长上的双向依赖性，提出了一种上下文敏感块（CSC）随时间反向传播（BPTT）算法来训练DBLSTM神经网络。已经对Switchboard基准测试任务进行了仿真实验。结果表明，改进后的混合声学模型的WER为14.5％，并给出了最优的网络结构和CSC配置。

著录项

来源
《IEEE International Conference on Robotics and Biomimetics》|2017年|1599-1604|共6页
会议地点
作者
Yuan Luo; Yu Liu; Yi Zhang; Boyu Wang; Zhou Ye;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neurons; Training; Acoustics; Hidden Markov models; Biological neural networks; Speech recognition; Logic gates;

机译：神经元;训练;声学;隐马尔可夫模型;生物神经网络;语音识别;逻辑门;

相似文献

外文文献
中文文献
专利

1. Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach [J] . Kai Chen, Qiang Huo Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第7期

机译：通过上下文敏感块BPTT方法训练LVCSR的深度双向LSTM声学模型
2. Decomposition-based hybrid wind speed forecasting model using deep bidirectional LSTM networks [J] . Jaseena K. U., Kovoor Binsu C. Energy Conversion & Management . 2021,第Apra期

机译：基于分解的混合风速预测模型，使用深双向LSTM网络
3. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification [J] . Yildirim Ozal Computers in Biology and Medicine . 2018,第期

机译：一种基于深双向LSTM网络模型的新型小波序列ECG信号分类
4. Maxout Neurons Based Deep Bidirectional LSTM for Acoustic Modeling [C] . Yuan Luo, Yu Liu, Yi Zhang, IEEE International Conference on Robotics and Biomimetics . 2017

机译：基于MAXOUT神经元的声学建模深双向LSTM
5. Deep Learning-Based Hosting Capacity Analysis in LV Distribution Grids with Spatial-Temporal LSTMs [D] . Wu, Jiaqi. 2021

机译：LV分布网的基于深度学习的托管能力分析，具有空间时间LSTMS
6. LSTMCNNsucc: A Bidirectional LSTM and CNN-Based Deep Learning Method for Predicting Lysine Succinylation Sites [O] . Guohua Huang, Qingfeng Shen, Guiyang Zhang, 2021

机译：LSTMCNNSUCC：一种预测赖氨酸琥珀酸位点的双向LSTM和基于CNN的深度学习方法
7. A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition [O] . Zeyer, Albert, Doetsch, Patrick, Voigtlaender, Paul, 2017

机译：声学系统深双向LsTm RNN的综合研究语音识别中的建模

Maxout neurons based deep bidirectional LSTM for acoustic modeling

摘要

著录项

相似文献

相关主题

期刊订阅