Global Optimality in Neural Network Training

机译：神经网络训练中的全局最优性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The past few years have seen a dramatic increase in the performance of recognition systems thanks to the introduction of deep networks for representation learning. However, the mathematical reasons for this success remain elusive. A key issue is that the neural network training problem is nonconvex, hence optimization algorithms may not return a global minima. This paper provides sufficient conditions to guarantee that local minima are globally optimal and that a local descent strategy can reach a global minima from any initialization. Our conditions require both the network output and the regularization to be positively homogeneous functions of the network parameters, with the regularization being designed to control the network size. Our results apply to networks with one hidden layer, where size is measured by the number of neurons in the hidden layer, and multiple deep subnetworks connected in parallel, where size is measured by the number of subnetworks.

机译：在过去的几年中，由于引入了用于表示学习的深度网络，识别系统的性能有了显着提高。但是，成功的数学原因仍然难以捉摸。关键问题是神经网络训练问题是非凸的，因此优化算法可能不会返回全局最小值。本文提供了充分的条件来保证局部最小值是全局最优的，并且局部下降策略可以通过任何初始化来达到全局最小值。我们的条件要求网络输出和正则化都必须是网络参数的正同类函数，并且正则化旨在控制网络大小。我们的结果适用于具有一个隐藏层的网络，其大小由隐藏层中神经元的数量来衡量，而多个深层子网并行连接，其大小由子网络的数量来衡量。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2017年|4390-4398|共9页
会议地点
作者
Benjamin D. Haeffele; René Vidal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Biological neural networks; Optimization; Neurons; Minimization; Loss measurement; Algorithm design and analysis;

机译：训练;生物神经网络;优化;神经元;最小化;损失测量;算法设计与分析;

相似文献

外文文献
中文文献
专利

1. Globally optimal learning rates for multilayer neural networks [J] . Saad D., Rattray M. Philosophical magazine, B. Physics of condensed matter, electronic, optical, and magnetic properties . 1998,第5期

机译：多层神经网络的全球最佳学习率
2. Optimal robust control of vehicle lateral stability using damped least-square backpropagation training of neural networks [J] . Neurocomputing . 2020,第Apra7期

机译：使用阻尼最小二乘反向传播神经网络训练的车辆横向稳定性最佳鲁棒控制
3. Incorporating human and learned domain knowledge into training deep neural networks: A differentiable dose‐volume histogram and adversarial inspired framework for generating Pareto optimal dose distributions in radiation therapy [J] . Medical Physics . 2020,第3期

机译：将人和学习的领域知识纳入训练深层神经网络：一种可分化的剂量 - 体积直方图和对辐射治疗中的Pareto最佳剂量分布的对抗性启发框架
4. Globally Optimal Training of Generalized Polynomial Neural Networks with Nonlinear Spectral Methods [C] . A. Gautier, Q. Nguyen, M. Hein Annual conference on Neural Information Processing Systems . 2016

机译：非线性谱方法的广义多项式神经网络的全局最优训练
5. Adaptive multiple optimal learning factors for neural network training [D] . Challagundla, Jeshwanth 2015

机译：神经网络训练的自适应多个最佳学习因素
6. Determination of the Optimal Training Principle and Input Variables in Artificial Neural Network Model for the Biweekly Chlorophyll-a Prediction: A Case Study of the Yuqiao Reservoir China [O] . Yu Liu, Du-Gang Xi, Zhao-Liang Li -1

机译：双周叶绿素-a预测的人工神经网络模型的最优训练原理和输入变量的确定：以中国玉桥水库为例
7. Globally optimal learning rates in multilayer neural networks [O] . Saad, David, Rattray, Magnus 1998

机译：多层神经网络中的全球最佳学习率
8. Global Open Metacomputer Network and Its Optimal Design Using Neural NetworkArchitecture, Multi-Leg Optimization Model and Fault Tolerant Computers [R] . Rahko, K., Kari, H., Lassila, P., 1994

机译：基于神经网络架构，多腿优化模型和容错计算机的全球开放式计算机网络及其优化设计

Global Optimality in Neural Network Training

摘要

著录项

相似文献

相关主题

期刊订阅