Factorized Recurrent Neural Architectures for Longer Range Dependence

Francois Belletti; Alex Beutel; Sagar Jain; Ed Chi

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Factorized Recurrent Neural Architectures for Longer Range Dependence

【24h】

Factorized Recurrent Neural Architectures for Longer Range Dependence

机译：分解式递归神经体系结构可实现更长距离的依赖

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to capture Long Range Dependence (LRD) in a stochastic process is of prime importance in the context of predictive models. A sequential model with a longer-term memory is better able contextualize recent observations. In this article, we apply the theory of LRD stochastic processes to modern recurrent architectures, such as LSTMs and GRUs, and prove they do not provide LRD under assumptions sufficient for gradients to vanish. Motivated by an information-theoretic analysis, we provide a modified recurrent neural architecture that mitigates the issue of faulty memory through redundancy while keeping the compute time constant. Experimental results on a synthetic copy task, the Youtube-8m video classification task and a recommender system show that we enable better memorization and longer-term memory.

机译：在随机过程中捕获远程依赖关系（LRD）的能力在预测模型的背景下至关重要。具有长期记忆的顺序模型可以更好地将最近的观察结果与背景联系起来。在本文中，我们将LRD随机过程的理论应用于现代递归体系结构（例如LSTM和GRU），并证明它们在足以消除梯度的假设下不提供LRD。受信息理论分析的启发，我们提供了一种改进的循环神经体系结构，该体系结构通过冗余减轻了错误内存的问题，同时保持了计算时间不变。在合成复制任务，Youtube-8m视频分类任务和推荐系统上的实验结果表明，我们可以实现更好的记忆和长期记忆。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共9页
作者
Francois Belletti; Alex Beutel; Sagar Jain; Ed Chi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Factorized Recurrent Neural Architectures for Longer Range Dependence [J] . Francois Belletti, Alex Beutel, Sagar Jain, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：分解式递归神经体系结构可实现更长距离的依赖
2. Recurrent neural network for approximate nonnegative matrix factorization [J] . Giovanni Costantini, Renzo Perfetti, Massimiliano Todisco Neurocomputing . 2014,第auga22期

机译：递归神经网络用于近似非负矩阵分解
3. Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification [J] . Banerjee Imon, Ling Yuan, Chen Matthew C., Artificial intelligence in medicine . 2019,第JUNa期

机译：卷积神经网络（CNN）和递归神经网络（RNN）架构在放射学文本报告分类中的比较有效性
4. A Factorized Recurrent Neural Network based architecture for medium to large vocabulary Language Modelling [C] . Anantharaman Palacode Narayana Iyer IEEE International Conference on Semantic Computing . 2016

机译：基于媒体到大词汇语言建模的分解复经内网络架构
5. Architecture optimization, training convergence and network estimation robustness of a fully connected recurrent neural network. [D] . Wang, Xiaoyu. 2010

机译：完全连接的递归神经网络的体系结构优化，训练收敛和网络估计的鲁棒性。
6. Comment on: Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts [O] . Arjun Magge, Abeed Sarker, Azadeh Nikfarjam, 2019

机译：评论：用于药物警戒的深度学习：用于在Twitter帖子中标记药物不良反应的递归神经网络体系结构
7. A Factorized Recurrent Neural Network based architecture for medium to large vocabulary Language Modelling [O] . Iyer, Anantharaman Palacode Narayana 2016

机译：基于分解回归神经网络的中到中等体系结构大词汇量语言建模

Factorized Recurrent Neural Architectures for Longer Range Dependence

摘要

著录项

相似文献

相关主题

期刊订阅