Learning Long Term Dependencies via Fourier Recurrent Units

Jiong Zhang; Yibo Lin; Zhao Song; Inderjit Dhillon

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Learning Long Term Dependencies via Fourier Recurrent Units

【24h】

Learning Long Term Dependencies via Fourier Recurrent Units

机译：通过傅里叶复发单元学习长期依赖性

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is a known fact that training recurrent neural networks for tasks that have long term dependencies is challenging. One of the main reasons is the vanishing or exploding gradient problem, which prevents gradient information from propagating to early layers. In this paper we propose a simple recurrent architecture, the Fourier Recurrent Unit (FRU), that stabilizes the gradients that arise in its training while giving us stronger expressive power. Specifically, FRU summarizes the hidden states $h^{(t)}$ along the temporal dimension with Fourier basis functions. This allows gradients to easily reach any layer due to FRU’s residual learning structure and the global support of trigonometric functions. We show that FRU has gradient lower and upper bounds independent of temporal dimension. We also show the strong expressivity of sparse Fourier basis, from which FRU obtains its strong expressive power. Our experimental study also demonstrates that with fewer parameters the proposed architecture outperforms other recurrent architectures on many tasks.

机译：众所周知，培训具有长期依赖性的任务的经常性神经网络是具有挑战性的。其中一个主要原因是消失或爆炸梯度问题，这可以防止梯度信息传播到早期层。在本文中，我们提出了一种简单的经常性架构，傅里叶复发单元（FRU），其稳定在其培训中产生的梯度，同时给予我们更强大的表现力。具体来说，FRU总结了具有傅立叶基函数的时间维度的隐藏状态$ h ^ {（t）} $。这允许梯度由于FRU的剩余学习结构和三角函数的全球支持而轻松地达到任何层。我们表明FRU与颞尺寸无关的渐变下限和上限。我们还阐述了稀疏傅立叶的强烈表现，从中获得了强大的表现力。我们的实验研究还展示了较少的参数，所提出的体系结构在许多任务中优于其他经常性架构。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共9页
作者
Jiong Zhang; Yibo Lin; Zhao Song; Inderjit Dhillon;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. On orthogonality and learning recurrent networks with long term dependencies [J] . Eugene Vorontsov, Chiheb Trabelsi, Samuel Kadoury, JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：关于正交和学习具有长期依赖性的递归网络
2. Learning long-term dependencies in segmented-memory recurrent neural networks with backpropagation of error [J] . Stefan Gluege, Ronald Boeck, Guenther Palm, Neurocomputing . 2014,第octa2期

机译：在错误的反向传播下学习分段记忆递归神经网络中的长期依赖性
3. Learning long-term dependencies with recurrent neural networks [J] . Anton Maximilian Schaefer, Steffen Udluft, Hans-Georg Zimmermann Neurocomputing . 2008,第13a15期

机译：使用递归神经网络学习长期依赖
4. Towards Non-Saturating Recurrent Units for Modelling Long-Term Dependencies [C] . Sarath Chandar, Chinnadhurai Sankar, Eugene Vorontsov, AAAI Conference on Artificial Intelligence . 2019

机译：朝向非饱和复发单位，用于建模长期依赖性
5. Learning complex linguistic structure: The Simple Recurrent Network (SRN) as a model of nonadjacent dependency learning. [D] . Willits, Jon A. 2012

机译：学习复杂的语言结构：简单递归网络（SRN）作为非相邻依赖学习的模型。
6. Characteristics of psychiatric discharges from nonfederal short-term specialty hospitals and general hospitals with and without psychiatric and chemical dependency units: the Hospital Discharge Survey data. [O] . C A Kiesler, A E Sibulkin, T L Morton, 1991

机译：非联邦短期专科医院和有或没有精神病和化学依赖单位的综合医院的精神病出院特征：医院出院调查数据。
7. How Embedded Memory in Recurrent Neural Network Architectures Helps Learning Long-term Temporal Dependencies [O] . Tsungnan Lin, Bill G. Horne, C. Lee Giles 1996

机译：回归神经网络架构中的嵌入式存储器如何帮助学习长期时间依赖性

Learning Long Term Dependencies via Fourier Recurrent Units

摘要

著录项

相似文献

相关主题

期刊订阅