首页> 外文会议>Workshop on representation learning for NLP >Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs

【24h】

Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs

机译：递归神经网络中的消失梯度和长距离依赖性问题和递归LSTMS

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recursive neural networks (RNN) and their recently proposed extension recursive long short term memory networks (RLSTM) are models that compute representations for sentences, by recursively combining word embeddings according to an externally provided parse tree. Both models thus, unlike recurrent networks, explicitly make use of the hierarchical structure of a sentence. In this paper, we demonstrate that RNNs nevertheless suffer from the vanishing gradient and long distance dependency problem, and that RLSTMs greatly improve over RNN's on these problems. We present an artificial learning task that allows us to quantify the severity of these problems for both models. We further show that a ratio of gradients (at the root node and a focal leaf node) is highly indicative of the success of backpropagation at optimizing the relevant weights low in the tree. This paper thus provides an explanation for existing, superior results of RLSTMs on tasks such as sentiment analysis, and suggests that the benefits of including hierarchical structure and of including LSTM-style gating are complementary.

机译：递归神经网络（RNN）及其最近提出的扩展递归长短短期内存网络（RLSTM）是根据从外部提供的解析树递归地组合Word Embeddings的句子来计算句子的表示的模型。因此，这两个模型与经常性网络不同，明确地利用句子的分层结构。在本文中，我们证明RNN仍然存在消失的梯度和长距离依赖性问题，并且RLSTMS在这些问题上大大改善了RNN。我们提出了一个人工学习任务，允许我们量化两种模型的这些问题的严重程度。我们进一步示出了梯度（在根节点和焦点叶节点处）的比率高度指示在优化树中的低低的相关权重成功。因此，本文提供了对诸如情感分析的任务的现有，卓越的结果的解释，并提出包括分层结构和包括LSTM样式门控的益处是互补的。

著录项

来源
《Workshop on representation learning for NLP》|2016年|xiii 266 p.|共7页
会议地点
作者
Phong Le; Willem Zuidema;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks [J] . Predrag Kukic, Claudio Mirabello, Giuseppe Tradigo, BMC Bioinformatics . 2014,第1期

机译：使用2D递归神经网络准确预测蛋白质中残基间的距离
2. Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks [J] . Ian Walsh, Davide Baù, Alberto JM Martin, BMC Structural Biology . 2009,第1期

机译：二维递归神经网络的从头算和基于模板的多类距离图预测
3. Non-homogenous neural networks with chaotic recursive nodes: connectivity and multi-assemblies structures in recursive processing elements architectures. [J] . Del Moral Hernandez E Neural Networks: The Official Journal of the International Neural Network Society . 2005,第5a6期

机译：具有混沌递归节点的非均匀神经网络：递归处理元素体系结构中的连接性和多组件结构。
4. Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs [C] . Phong Le, Willem Zuidema 1st workshop on representation learning for NLP 2016 . 2016

机译：量化递归神经网络和递归LSTM中消失的梯度和长距离依赖问题
5. Template-Based Question Answering over Linked Data Using Recursive Neural Networks [D] . Athreya, Ram Ganesan. 2018

机译：使用递归神经网络对链接数据进行基于模板的问题回答
6. Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks [O] . Predrag Kukic, Claudio Mirabello, Giuseppe Tradigo, 2014

机译：使用2D递归神经网络准确预测蛋白质中的残基间距离
7. Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs [O] . Le, Phong, Zuidema, Willem 2016

机译：量化消失梯度和长距离依赖问题在递归神经网络和递归LsTm中

Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs

摘要

著录项

相似文献

相关主题

期刊订阅