Learning Over-Parametrized Two-Layer Neural Networks beyond NTK

Yuanzhi Li; Tengyu Ma; Hongyang R. Zhang

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Learning Over-Parametrized Two-Layer Neural Networks beyond NTK

【24h】

Learning Over-Parametrized Two-Layer Neural Networks beyond NTK

机译：学习超出NTK之外的过度参数化的双层神经网络

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the dynamic of gradient descent for learning a two-layer neural network. We assume the input $xinmathbb{R}^d$ is drawn from a Gaussian distribution and the label of $x$ satisfies $f^{star}(x) = a^{op}|W^{star}x|$, where $ainmathbb{R}^d$ is a nonnegative vector and $W^{star} inmathbb{R}^{dimes d}$ is an orthonormal matrix. We show that an emph{over-parameterized} two layer neural network with ReLU activation, trained by gradient descent from emph{random initialization}, can provably learn the ground truth network with population loss at most $o(1/d)$ in polynomial time with polynomial samples. On the other hand, we prove that any kernel method, including Neural Tangent Kernel, with a polynomial number of samples in $d$, has population loss at least $Omega(1 / d)$.

机译：我们考虑学习双层神经网络的梯度下降的动态。我们假设输入$ x in mathbb {r} ^ d $从高斯分发和$ x $的标签绘制，满足$ f ^ { star}（x）= a ^ { top} | w ^ { star} x | $，其中$ a in mathbb {r} ^ d $是一个非负向量和$ w ^ { star} in mathbb {r} ^ {d times d} $正式矩阵。我们展示了一种带有relu激活的 emph {过度参数化}两个层神经网络，通过梯度下降从 memph {随机初始化}训练，可以在最多$ o（1 / d）上以人口损失可被证明地学习地面真相网络多项式时间与多项式样品。另一方面，我们证明了任何内核方法，包括神经切线内核，以美元为单位的样本数量，具有至少$ omega（1 / d）$的人口损失。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2020年第2010期|共70页
作者
Yuanzhi Li; Tengyu Ma; Hongyang R. Zhang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A two-layer networked learning control system using actor-critic neural network [J] . Du DJ, Fei MR Applied mathematics and computation . 2008,第1期

机译：基于行为者神经网络的两层网络学习控制系统
2. Improved learning algorithm for two-layer neural networks for identification of nonlinear systems [J] . Vargas Jose A. R., Pedrycz Witold, Hemerly Elder M. Neurocomputing . 2019,第FEBa15期

机译：改进的用于识别非线性系统的两层神经网络学习算法
3. On the Connection Between Learning Two-Layer Neural Networks and Tensor Decomposition [J] . Marco Mondelli, Andrea Montanari JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：学习两层神经网络与张量分解之间的联系
4. Self-learning fuzzy control strategy of two-layer networked learning control systems based on improved RBF neural network [C] . Du Dajun, Li Xue, Fei Minrui, 2011 30th Chinese Control Conference . 2011

机译：基于改进RBF神经网络的两层网络学习控制系统自学习模糊控制策略
5. Recurrent neural network learning and neural network learning controller. [D] . Yan, Lilai. 1994

机译：递归神经网络学习和神经网络学习控制器。
6. Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by guided-learning through a two-layer neural network [O] . Eshel Faraggi, Bin Xue, Yaoqi Zhou -1

机译：通过两层神经网络的导引学习提高残留溶剂可及性和蛋白质实值主链扭转角的预测精度
7. Learning behavior and temporary minima of two-layer neural networks [O] . Annema, Anne J., Hoen, Klaas, Wallinga, Hans 1994

机译：两层神经网络的学习行为和暂时最小值

Learning Over-Parametrized Two-Layer Neural Networks beyond NTK

摘要

著录项

相似文献

相关主题

期刊订阅