Robust and resource efficient identification of shallow neural networks by fewest samples

Massimo Fornasier; Jan Vybíral; Ingrid Daubechies

首页> 外文期刊>Information and inference >Robust and resource efficient identification of shallow neural networks by fewest samples

【24h】

Robust and resource efficient identification of shallow neural networks by fewest samples

机译：最少的样本对浅神经网络的强大和资源有效识别

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We address the structure identification and the uniform approximation of sums of ridge functions f (x) = Σ_(i=1)~m g_i(〈a_i, x〉) on Rd, representing a general form of a shallow feed-forward neural network, from a small number of query samples. Higher order differentiation, as used in our constructive approximations, of sums of ridge functions or of their compositions, as in deeper neural network, yields a natural connection between neural network weight identification and tensor product decomposition identification. In the case of the shallowest feed-forward neural network, second-order differentiation and tensors of order two (i.e., matrices) suffice as we prove in this paper. We use two sampling schemes to perform approximate differentiation—active sampling, where the sampling points are universal, actively and randomly designed, and passive sampling, where sampling points were preselected at random from a distribution with known density. Based on multiple gathered approximated first- and second-order differentials, our general approximation strategy is developed as a sequence of algorithms to perform individual sub-tasks. We first perform an active subspace search by approximating the span of the weight vectors a_1, . . . , am. Then we use a straightforward substitution, which reduces the dimensionality of the problem from d to m. The core of the construction is then the stable and efficient approximation of weights expressed in terms of rank-1 matrices a_i ?a_i, realized by formulating their individual identification as a suitable nonlinear program. We prove the successful identification by this program of weight vectors being close to orthonormal and we also show how we can constructively reduce to this case by a whitening procedure, without loss of any generality. We finally discuss the implementation and the performance of the proposed algorithmic pipeline with extensive numerical experiments, which illustrate and confirm the theoretical results.

机译：我们解决脊函数总和f（x）=σ_（i = 1）〜m g_i（）的结构识别和均匀近似，来自少数查询样品。在我们的建设性近似中使用的高阶分化，脊函数或其组成的总和（如更深的神经网络中）在神经网络重量识别与张量产品分解识别之间产生了自然的联系。在最浅的前馈神经网络的情况下，二阶分化和二阶的张量（即矩阵）足够，正如我们在本文中所证明的那样。我们使用两个采样方案进行近似分化 - 活性抽样，其中采样点是通用，主动和随机设计和被动采样的，其中采样点是从具有已知密度分布的分布中随机预选的。基于多个收集的近似一阶和二阶差异，我们的一般近似策略是作为执行单个子任务的算法序列而开发的。我们首先通过近似权重向量A_1的跨度执行活动子空间搜索。。。，是。然后，我们使用直接的替代，这将问题的维度从d降低到m。然后，构造的核心是根据级别1矩阵A_I？a_i表示的稳定，有效的权重近似，通过将其个体识别作为合适的非线性程序来实现。我们证明了这个体重向量程序的成功识别接近正统的，我们还展示了如何通过美白程序建设性地减少这种情况，而不会丧失任何通用性。我们最终通过广泛的数值实验讨论了所提出的算法管道的实现和性能，从而说明和确认了理论结果。

著录项

来源
《Information and inference》 |2021年第2期|625-695|共71页
作者
Massimo Fornasier; Jan Vybíral; Ingrid Daubechies;
展开▼
作者单位

Department of Mathematics, TU München, Boltzmannstr. 3, D-85748 Garching bei München, Germany;

Department of Mathematics FNSPE, Czech Technical University in Prague, Trojanova 13, 12000 Prague, Czech Republic;

Department of Mathematics, Duke University, 120 Science Drive, Durham North Carolina 27708, USA;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Authentication; Orthogonal; FEEDFORWARDBleachingRobustnessSample pointrandomized algorithmsweight vectorsNeural networkuniform approximationTensor;

机译：身份验证;正交;feedforforwardningRobustnessspample pointrandomized算法vecight vectornal vectornal nocerniform auntiber auntimor youthifor;

相似文献

外文文献
中文文献
专利

1. Robust and efficient direct multiplex amplification method for large-scale DNA detection of blood samples on FTA cards [J] . JIANG Bowei, XIANG Fawei, ZHAO Xingchun, 核技术（英文版） . 2013,第003期
2. [特别招待講演] Role of optical technologies to create energy efficient transport networks [J] . Ken-ichi Sato 電子情報通信学会技術研究報告 . 2011,第348期

机译：[特别招待讲演] Role of optical technologies to create energy efficient transport networks
3. [特别招待講演] Role of optical technologies to create energy efficient transport networks [J] . Ken-ichi Sato 電子情報通信学会技術研究報告 . 2011,第349期

机译：[特别招待讲演] Role of optical technologies to create energy efficient transport networks
4. Discussion of 'Maximum Gradient Decision-Making for Railways Based on Convolutional Neural Network' by Hao Pu, Hong Zhang, Paul Schonfeld, Wei Li, Jie Wang, Xianbao Peng, and Jianping Hu [J] . Miao Kun, Kuang Wei, Wu Lifeng Journal of Transportation Engineering . 2021,第4期

机译：Discussion of 'Maximum Gradient Decision-Making for Railways Based on Convolutional Neural Network' by Hao Pu, Hong Zhang, Paul Schonfeld, Wei Li, Jie Wang, Xianbao Peng, and Jianping Hu
5. 改进的CMAC神经网络非线性辨识算法（A Nonlinear Identification Algorithm of the Improved CMAC Neural Network） [C] . Chinese Control Conference vol.2; 20040810-13; Wuxi(CN) . 2004

机译：改进的CMAC神经网络非线性辨识算法（A Nonlinear Identification Algorithm of the Improved CMAC Neural Network）
6. Learning Task-Oriented Dialog with Neural Network Methods =基于神经网络的任务型对话学习 [D] . Liu, Bing. 2018

机译：Learning Task-Oriented Dialog with Neural Network Methods =基于神经网络的任务型对话学习
7. SpikingLab: modelling agents controlled by Spiking Neural Networks in Netlogo [O] . Cristian Jimenez-Romero, Jeffrey Johnson -1

机译：SpikingLab：由Netlogo中的Spiking Neural Networks控制的建模代理
8. Peak identification for PCDD/Fs in environmental samples at different temperature programs [O] . Wang W, Liang X, Wu W, 2000

机译：peak identification for pCDD/Fs in environmental samples at different temperature programs
9. Quality Assurance Guidance Document. Quality Assurance Project Plan: PM 2.5Speciation Trends Network Field Sampling [R] . Mikel, D., Flanagan, J., Musick, D., 2000

机译：质量保证指导文件。质量保证项目计划：pm 2.5speciation Trends Network Field sampling

Robust and resource efficient identification of shallow neural networks by fewest samples

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅