Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling

机译：递归神经网络的可扩展贝叶斯学习用于语言建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recurrent neural networks (RNNs) have shown promising performance for language modeling. However, traditional training of RNNs using back-propagation through time often suffers from overfitting. One reason for this is that stochastic optimization (used for large training sets) does not provide good estimates of model uncertainty. This paper leverages recent advances in stochastic gradient Markov Chain Monte Carlo (also appropriate for large training sets) to learn weight uncertainty in RNNs. It yields a principled Bayesian learning algorithm, adding gradient noise during training (enhancing exploration of the model-parameter space) and model averaging when testing. Extensive experiments on various RNN models and across a broad range of applications demonstrate the superiority of the proposed approach relative to stochastic optimization.

机译：递归神经网络（RNN）在语言建模方面显示出令人鼓舞的性能。但是，传统的使用时间反向传播的RNN训练经常会遭受过度拟合的困扰。原因之一是随机优化（用于大型训练集）不能很好地估计模型不确定性。本文利用随机梯度马尔可夫链蒙特卡洛（也适用于大型训练集）的最新进展来学习RNN中的权重不确定性。它产生了一种有原则的贝叶斯学习算法，在训练过程中增加了梯度噪声（增强了对模型参数空间的探索），并在测试时进行了模型平均。在各种RNN模型上以及广泛应用中的大量实验证明了所提出的方法相对于随机优化的优越性。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;Conference of the European Chapter of the Association for Computational Linguistics》|2017年|321-331|共11页
会议地点
作者
Zhe Gan; Chunyuan Li; Changyou Chen; Yunchen Pu; Qinliang Su; Lawrence Carin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian Recurrent Neural Network for Language Modeling [J] . Chien Jen-Tzung, Ku Yuan-Chu Neural Networks and Learning Systems, IEEE Transactions on . 2016,第2期

机译：贝叶斯递归神经网络的语言建模
2. Recurrent neural network language model adaptation with curriculum learning [J] . Yangyang Shi, Martha Larson, Catholijn M. Jonker Computer speech and language . 2015,第1期

机译：递归神经网络语言模型自适应与课程学习
3. Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks [J] . Bitzer S., Kiebel S.J. Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2012,第4a5期

机译：识别递归神经网络（rRNN）：递归神经网络的贝叶斯推断
4. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling [C] . Zhe Gan, Chunyuan Li, Changyou Chen, Annual meeting of the Association for Computational Linguistics . 2017

机译：可扩展的贝叶斯学习语言建模的经常性神经网络
5. Gene expression temporal patterns classification with hierarchical Bayesian neural networks and time lagged recurrent neural networks. [D] . Liang, Yulan. 2003

机译：利用分层贝叶斯神经网络和时滞递归神经网络对基因表达时间模式进行分类。
6. Large-scale directional connections among multi resting-state neural networks in human brain: A functional MRI and Bayesian network modeling study [O] . Rui Li, Kewei Chen, Adam S. Fleisher, -1

机译：人脑中多静态神经网络中的大规模定向连接：功能性MRI和贝叶斯网络建模研究
7. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling [O] . Gan, Zhe, Li, Chunyuan, Chen, Changyou, 2017

机译：语言递归神经网络的可扩展贝叶斯学习造型

Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling

摘要

著录项

相似文献

相关主题

期刊订阅