Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks

机译：学习单隐层卷积神经网络的紧密样本复杂性

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We study the sample complexity of learning one-hidden-layer convolutional neural networks (CNNs) with non-overlapping filters. We propose a novel algorithm called approximate gradient descent for training CNNs, and show that, with high probability, the proposed algorithm with random initialization grants a linear convergence to the ground-truth parameters up to statistical precision. Compared with existing work, our result applies to general non-trivial, monotonic and Lipschitz continuous activation functions including ReLU, Leaky ReLU, Sigmod and Soft-plus etc. Moreover, our sample complexity beats existing results in the dependency of the number of hidden nodes and filter size. In fact, our result matches the information-theoretic lower bound for learning one-hidden-layer CNNs with linear activation functions, suggesting that our sample complexity is tight. Our theoretical analysis is backed up by numerical experiments.

机译：我们研究了使用非重叠滤波器学习一层卷积神经网络（CNNS）的样本复杂性。我们提出了一种称为近似梯度下降的新型算法，用于训练CNN，并表明，具有高概率，所提出的随机初始化的算法授予地面真理参数的线性会聚，达到统计精度。与现有工作相比，我们的结果适用于一般的非琐碎，单调和嘴唇尖端连续激活功能，包括Relu，Leaky Relu，Sigmod和Soft-Plus等。此外，我们的样本复杂性节拍了隐藏节点数量的依赖性结果和过滤尺寸。实际上，我们的结果与用线性激活功能学习一个隐藏层CNN的信息 - 理论下限，表明我们的样本复杂性很紧。我们的理论分析由数值实验支持。

著录项

来源
《Conference on Neural Information Processing Systems》|2020年|p10355-11148|共11页
会议地点
作者
Yuan Cao; Quanquan Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学;
关键词

相似文献

外文文献
中文文献
专利

1. Learning One-hidden-layer Neural Networks under General Input Distributions [J] . Weihao Gao, Ashok V. Makkuva, Sewoong Oh, JMLR: Workshop and Conference Proceedings . 2018,第3期

机译：在一般输入分布下学习一层隐藏神经网络
2. Learning One-hidden-layer Neural Networks under General Input Distributions [J] . Weihao Gao, Ashok V. Makkuva, Sewoong Oh, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：在一般输入分布下学习一层隐藏神经网络
3. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
4. Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks [C] . Yuan Cao, Quanquan Gu Conference on Neural Information Processing Systems . 2020

机译：学习单隐层卷积神经网络的紧密样本复杂性
5. Programmable Manycore Accelerator for Machine Learning, Convolution Neural Network and Binary Neural Network [D] . Kulkarni, Adwaya Amey. 2017

机译：面向机器学习，卷积神经网络和二进制神经网络的可编程Manycore加速器
6. A novel microaneurysms detection approach based on convolutional neural networks with reinforcement sample learning algorithm [O] . Umit Budak, Abdulkadir Şengür, Yanhui Guo, 2017

机译：基于卷积神经网络的增强样本学习算法的微动脉瘤检测新方法
7. A new learning algorithm based on strengthening boundary samples for convolutional neural networks [O] . Dongning Zhou, Lu Lu, Junhong Zhao, 2020

机译：一种新的基于加强卷积神经网络边界样本的新学习算法

Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅