Second-order optimization for neural networks.

机译：神经网络的二阶优化。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural networks are an important class of highly flexible and powerful models inspired by the structure of the brain. They consist of a sequence of interconnected layers, each comprised of basic computational units similar to the gates of a classical circuit. And like circuits, they have the capacity to perform simple computational procedures such as those which might underlie the generating process of the dataset they are trained on. The most popular and successful approach for learning neural networks is to optimize their parameters with respect to some objective function using standard methods for nonlinear optimization. Because basic methods like stochastic gradient descent (SGD) can often be very slow for deeply layered neural networks, or ones with recurrent connections, it is worthwhile to consider more advanced methods. In this thesis we review and analyze various such methods that have been proposed over the past few decades, with a particular focus on approximate-Newton/2nd-order ones, and develop two of our own which we call Hessian-free optimization (HF) and Kronecker-factored Approximate Curvature (K-FAC) respectively. Our experiments show that K-FAC can be much faster in practice at optimizing deep neural networks than well-tuned SGD with momentum.

机译：神经网络是受到大脑结构启发的一类重要的高度灵活且功能强大的模型。它们由一系列相互连接的层组成，每个层由类似于经典电路门的基本计算单元组成。像电路一样，它们具有执行简单计算程序的能力，例如那些可能是训练了数据集的生成过程的基础的程序。学习神经网络最流行和成功的方法是使用非线性优化的标准方法针对某些目标函数优化其参数。因为对于深度分层的神经网络或具有递归连接的神经网络，诸如随机梯度下降（SGD）之类的基本方法通常可能非常慢，所以值得考虑使用更高级的方法。在本文中，我们回顾和分析了过去几十年中提出的各种此类方法，特别是针对近似牛顿/ 2阶方法，并开发了我们自己的两种方法，我们将其称为无黑塞最优化（HF）和Kronecker系数的近似曲率（K-FAC）。我们的实验表明，在优化深度神经网络方面，K-FAC在实践中比动量良好的SGD更快。

著录项

作者
Martens, James.;
展开▼
作者单位

University of Toronto (Canada).;

展开▼
授予单位 University of Toronto (Canada).;
学科 Computer science.
学位 Ph.D.
年度 2016
页码 179 p.
总页数 179
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A generalized LSTM-like training algorithm for second-order recurrent neural networks. [J] . Monner D, Reggia JA Neural Networks: The Official Journal of the International Neural Network Society . 2012,第Jana期

机译：用于二阶递归神经网络的通用LSTM训练算法。
2. Optimization of cellulose phosphate synthesis from oil palm lignocellulosics using wavelet neural networks. [J] . Wanrosli W. D., Zainuddin Z., Ong P., Industrial Crops and Products . 2013,第Null期

机译：利用小波神经网络优化从油棕木质纤维素合成磷酸纤维素的方法。
3. Development of novel formulations for Chagas' disease: Optimization of benznidazole chitosan microparticles based on artificial neural networks. [J] . Leonardi D, Salomon CJ, Lamas MC, International Journal of Pharmaceutics . 2009,第1a2期

机译：开发恰加斯病的新型制剂：基于人工神经网络优化苯并咪唑壳聚糖微粒。
4. Real time neural networks. III. Alternative neural networks for speech applications [C] . Tatman, G., Jannarone, . 1991

机译：实时神经网络。三，语音应用的替代神经网络
5. Optimization problems arising in stability analysis of discrete time recurrent neural networks. [D] . Singh, Jayant. 2016

机译：离散时间递归神经网络稳定性分析中出现的优化问题。
6. Optimized Parallel Coding of Second-Order Stimulus Features by Heterogeneous Neural Populations [O] . Chengjie G. Huang, Maurice J. Chacron 2016

机译：异类神经种群优化二阶刺激特征的并行编码
7. Optimizing Semantic Pointer Representations for Symbol-Like Processing in Spiking Neural Networks. [O] . Jan Gosmann, Chris Eliasmith 2016

机译：优化尖峰神经网络中符号处理的语义指针表示。

Second-order optimization for neural networks.

摘要

著录项

相似文献

相关主题

期刊订阅