Globally Optimal Training of Generalized Polynomial Neural Networks with Nonlinear Spectral Methods

机译：非线性谱方法的广义多项式神经网络的全局最优训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The optimization problem behind neural networks is highly non-convex. Training with stochastic gradient descent and variants requires careful parameter tuning and provides no guarantee to achieve the global optimum. In contrast we show under quite weak assumptions on the data that a particular class of feedforward neural networks can be trained globally optimal with a linear convergence rate with our nonlinear spectral method. Up to our knowledge this is the first practically feasible method which achieves such a guarantee. While the method can in principle be applied to deep networks, we restrict ourselves for simplicity in this paper to one and two hidden layer networks. Our experiments confirm that these models are rich enough to achieve good performance on a series of real-world datasets.

机译：神经网络背后的优化问题是高度非凸的。随机梯度下降和变化形式的训练需要仔细的参数调整，不能保证达到全局最优。相比之下，我们在非常弱的数据假设下表明，可以使用非线性谱方法以线性收敛速率对一类特定的前馈神经网络进行全局最优训练。据我们所知，这是第一个获得这种保证的切实可行的方法。虽然该方法原则上可以应用于深层网络，但为简单起见，我们在本文中将自身限制为一个和两个隐藏层网络。我们的实验证实，这些模型足够丰富，可以在一系列实际数据集中实现良好的性能。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|1695-1703|共9页
会议地点
作者
A. Gautier; Q. Nguyen; M. Hein;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. STOCHASTIC GENERALIZED GRADIENT METHODS FOR TRAINING NONCONVEX NONSMOOTH NEURAL NETWORKS [J] . Norkin V. I. Cybernetics and Systems Analysis . 2021,第5期

机译：用于训练非凸起非光华神经网络的随机广义梯度方法
2. Global exponential system of projection neural networks for system of generalized variational inequalities and related nonlinear minimax problems [J] . Qingshan Liu, Yongqing Yang Neurocomputing . 2010,第10a12期

机译：广义变分不等式和相关非线性极小极大问题的投影神经网络全局指数系统
3. Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming [J] . Liu D., Wang D., Zhao D., Automation Science and Engineering, IEEE Transactions on . 2012,第3期

机译：基于神经网络的一类未知离散非线性系统的全局最优启发式控制
4. Globally Optimal Training of Generalized Polynomial Neural Networks with Nonlinear Spectral Methods [C] . A. Gautier, Q. Nguyen, M. Hein Annual conference on Neural Information Processing Systems . 2016

机译：具有非线性光谱方法的全球多项式神经网络的全球最佳训练
5. Hybrid solution of stochastic optimal control problems using Gauss pseudospectral method and generalized polynomial chaos algorithms. [D] . Cottrill, Gerald C. 2012

机译：使用高斯伪谱方法和广义多项式混沌算法的混合随机最优控制问题求解。
6. The Use of Generalized Laguerre Polynomials in Spectral Methods for Solving Fractional Delay Differential Equations [O] . M. M. Khader -1

机译：广义Laguerre多项式在谱法中解分数阶时滞微分方程的应用。
7. The use of generalized Laguerre polynomials in spectral methods for nonlinear differential equations [O] . Khabibrakhmanov I.K., Summers D. 1998

机译：广义Laguerre多项式在非线性微分方程频谱方法中的使用
8. Hybrid Solution of Stochastic Optimal Control Problems Using Gauss Pseudospectral Method and Generalized Polynomial Chaos Algorithms [R] . Cottrill, G. C. 2012

机译：基于高斯伪谱法和广义多项式混沌算法的随机最优控制问题的混合解

Globally Optimal Training of Generalized Polynomial Neural Networks with Nonlinear Spectral Methods

摘要

著录项

相似文献

相关主题

期刊订阅