首页> 外文会议>Annual conference on Neural Information Processing Systems >Globally Optimal Training of Generalized Polynomial Neural Networks with Nonlinear Spectral Methods
【24h】

Globally Optimal Training of Generalized Polynomial Neural Networks with Nonlinear Spectral Methods

机译:非线性谱方法的广义多项式神经网络的全局最优训练

获取原文

摘要

The optimization problem behind neural networks is highly non-convex. Training with stochastic gradient descent and variants requires careful parameter tuning and provides no guarantee to achieve the global optimum. In contrast we show under quite weak assumptions on the data that a particular class of feedforward neural networks can be trained globally optimal with a linear convergence rate with our nonlinear spectral method. Up to our knowledge this is the first practically feasible method which achieves such a guarantee. While the method can in principle be applied to deep networks, we restrict ourselves for simplicity in this paper to one and two hidden layer networks. Our experiments confirm that these models are rich enough to achieve good performance on a series of real-world datasets.
机译:神经网络背后的优化问题是高度非凸的。随机梯度下降和变化形式的训练需要仔细的参数调整,不能保证达到全局最优。相比之下,我们在非常弱的数据假设下表明,可以使用非线性谱方法以线性收敛速率对一类特定的前馈神经网络进行全局最优训练。据我们所知,这是第一个获得这种保证的切实可行的方法。虽然该方法原则上可以应用于深层网络,但为简单起见,我们在本文中将自身限制为一个和两个隐藏层网络。我们的实验证实,这些模型足够丰富,可以在一系列实际数据集中实现良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号