Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN

机译：独立递归神经网络（IndRNN）：建立更长更深的RNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recurrent neural networks (RNNs) have been widely used for processing sequential data. However, RNNs are commonly difficult to train due to the well-known gradient vanishing and exploding problems and hard to learn long-term patterns. Long short-term memory (LSTM) and gated recurrent unit (GRU) were developed to address these problems, but the use of hyperbolic tangent and the sigmoid action functions results in gradient decay over layers. Consequently, construction of an efficiently trainable deep network is challenging. In addition, all the neurons in an RNN layer are entangled together and their behaviour is hard to interpret. To address these problems, a new type of RNN, referred to as independently recurrent neural network (IndRNN), is proposed in this paper, where neurons in the same layer are independent of each other and they are connected across layers. We have shown that an IndRNN can be easily regulated to prevent the gradient exploding and vanishing problems while allowing the network to learn long-term dependencies. Moreover, an IndRNN can work with non-saturated activation functions such as relu (rectified linear unit) and be still trained robustly. Multiple IndRNNs can be stacked to construct a network that is deeper than the existing RNNs. Experimental results have shown that the proposed IndRNN is able to process very long sequences (over 5000 time steps), can be used to construct very deep networks (21 layers used in the experiment) and still be trained robustly. Better performances have been achieved on various tasks by using IndRNNs compared with the traditional RNN and LSTM.

机译：递归神经网络（RNN）已被广泛用于处理顺序数据。但是，由于众所周知的梯度消失和爆炸问题，RNN通常很难训练，并且很难学习长期模式。开发了长短期记忆（LSTM）和门控循环单元（GRU）来解决这些问题，但是使用双曲线正切和S型作用函数会导致层上的梯度衰减。因此，构建有效可训练的深度网络具有挑战性。此外，RNN层中的所有神经元都纠缠在一起，其行为难以解释。为了解决这些问题，本文提出了一种新型的RNN，称为独立递归神经网络（IndRNN），其中同一层中的神经元彼此独立并且跨层连接。我们已经表明，可以轻松调节IndRNN，以防止梯度爆炸和消失的问题，同时允许网络学习长期依赖关系。此外，IndRNN可以与非饱和激活功能（例如relu（整流线性单元））一起使用，并且仍需经过严格培训。可以堆叠多个IndRNN，以构建比现有RNN更深的网络。实验结果表明，所提出的IndRNN能够处理非常长的序列（超过5000个时间步长），可用于构建非常深的网络（实验中使用的21层），并且仍然经过严格训练。与传统的RNN和LSTM相比，使用IndRNN可以在各种任务上实现更好的性能。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|5457-5466|共10页
会议地点 Salt Lake City(US)
作者
Shuai Li; Wanqing Li; Chris Cook; Ce Zhu; Yanbo Gao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neurons; Recurrent neural networks; Training; Logic gates; Backpropagation; Eigenvalues and eigenfunctions; Task analysis;

机译：神经元;递归神经网络；训练;逻辑门反向传播；特征值和特征函数；任务分析;
入库时间 2022-08-26 14:35:33

相似文献

外文文献
中文文献
专利

1. Bidirectional IndRNN malicious webpages detection algorithm based on convolutional neural network and attention mechanism [J] . Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2aPta2期

机译：基于卷积神经网络和注意机制的双向INDRNN恶意网页检测算法
2. Bidirectional IndRNN malicious webpages detection algorithm based on convolutional neural network and attention mechanism [J] . Ecological restoration . 2020,第2期

机译：基于卷积神经网络和注意机制的双向INDRNN恶意网页检测算法
3. Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks [J] . Bitzer S., Kiebel S.J. Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2012,第4a5期

机译：识别递归神经网络（rRNN）：递归神经网络的贝叶斯推断
4. Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN [C] . Shuai Li, Wanqing Li, Chris Cook, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：独立复发性神经网络（INDRNN）：建立更长，更深的RNN
5. Deep Risk: Timely Risk Scoring by a Recurrent Ensemble of Recurrent Neural Networks [D] . Nemchenko, Anton. 2018

机译：深度风险：由递归神经网络的递归集合及时评估风险
6. Automated AJCC (7th edition) staging of non-small cell lung cancer (NSCLC) using deep convolutional neural network (CNN) and recurrent neural network (RNN) [O] . Dipanjan Moitra, Rakesh Kr. Mandal 2019

机译：使用深卷积神经网络（CNN）和经常性神经网络（RNN）的自动化AJCC（第7版）分类（NSCLC）
7. Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN [O] . Shuai Li, Wanqing Li, Chris Cook, 2018

机译：独立复发性神经网络（INDRNN）：建立更长，更深的RNN

Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN

摘要

著录项

相似文献

相关主题

期刊订阅