Is Deeper Better only when Shallow is Good?

机译：只有浅才好，深才更好吗？

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Understanding the power of depth in feed-forward neural networks is an ongoing challenge in the field of deep learning theory. While current works account for the importance of depth for the expressive power of neural-networks, it remains an open question whether these benefits are exploited during a gradient-based optimization process. In this work we explore the relation between expressivity properties of deep networks and the ability to train them efficiently using gradient-based algorithms. We give a depth separation argument for distributions with fractal structure, showing that they can be expressed efficiently by deep networks, but not with shallow ones. These distributions have a natural coarse-to-fine structure, and we show that the balance between the coarse and fine details has a crucial effect on whether the optimization process is likely to succeed. We prove that when the distribution is concentrated on the fine details, gradient-based algorithms are likely to fail. Using this result we prove that, at least in some distributions, the success of learning deep networks depends on whether the distribution can be approximated by shallower networks, and we conjecture that this property holds in general.

机译：在深度学习理论领域，理解前馈神经网络中深度的力量是一个持续的挑战。虽然目前的工作说明了深度对于神经网络表达能力的重要性，但这些优势是否在基于梯度的优化过程中得到利用仍然是一个悬而未决的问题。在这项工作中，我们探索了深度网络的表达特性与使用基于梯度的算法有效训练它们的能力之间的关系。对于具有分形结构的分布，我们给出了一个深度分离论证，表明它们可以用深度网络有效地表达，但不能用浅网络有效地表达。这些分布具有从粗到细的自然结构，我们表明，粗细节和细细节之间的平衡对优化过程是否可能成功有着至关重要的影响。我们证明了当分布集中在细节上时，基于梯度的算法可能会失败。利用这个结果，我们证明了，至少在某些分布中，学习深度网络的成功取决于该分布是否可以被较浅的网络近似，并且我们猜想这个性质在一般情况下成立。

著录项

来源
《Conference on Neural Information Processing Systems》|2020年|p6363-7159|共10页
会议地点
作者
Eran Malach; Shai Shalev-Shwartz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学;
关键词

相似文献

外文文献
中文文献
专利

1. Comparing the Upper Triassic Deep-sea Flysch of the Shannan Terrane with the Coeval Shallow Shelf Sediments of the Tethys Himalaya,Southern Tibet [J] . LI Xianghui, Frank MATTERN 地质学报（英文版） . 2021,第002期
2. Discovery of the Ca-Cl type brine in deep aquifers and implications for the shallow giant glauberite deposits in the Lop Nur playa,Tarim Basin,NW China [J] . Hua Zhang, Li-cheng Wang, Li-jian Shen, 中国地质（英文） . 2021,第002期
3. Discovery of the Ca-Cl type brine in deep aquifers and implications for the shallow giant glauberite deposits in the Lop Nur playa, Tarim Basin, NW China [J] . Hua Zhang, Peng-cheng Jiao, Cheng-lin Liu, 中国地质(英文) . 2021,第002期
4. CFD Investigation of Mutual Interaction between Hull, Propellers, and Rudders for an Inland Container Ship in Deep, Very Deep, Shallow, and Very Shallow Waters [J] . Kaidi S., Smaoui H., Sergent P. 日本建築学会计画論文集 . 2018,第6期

机译：CFD研究深水，深水，浅水和浅水内河集装箱船的船体，螺旋桨和舵之间的相互作用
5. Deep Rooting in Winter Wheat : Rooting Nodes of Deep Roots in Two Cultivars with Deep and Shallow Root Systems [J] . Hideki Araki, Morio Iijima Plant Production Science . 2001,第3期

机译：冬小麦的深生根：具有深浅系统的两个品种的深根生根节点
6. Estimating deep water radiance in shallow water: adapting optical bathymetry modelling to shallow river environments [J] . Claude Flener Boreal Environment Research . 2013,第6期

机译：估计浅水深处的辐射度：使光学测深模型适应浅河环境
7. Shallow Gas and Shallow Water Flow Induced by Natural Gas Hydrate Dissociation in Deep Water Sediments [C] . Zhiwu Gong, Shaoran Ren, Liang Zhang, Offshore Technology Conference;ExxonMobil;Aseeder;Schlumberger . 2017

机译：天然气水合物在深水沉积物中的离解引起的浅层气体和浅层水流
8. Sediment Routing and Provenance of Shallow to Deep Marine Sandstones in the Late Paleozoic Oquirrh Basin, Utah [D] . Jones, Adam J. 2019

机译：犹他州晚古生代oquirrh盆地浅层到深海洋砂岩的沉积路由和出处
9. Comparative transcriptomic analysis of deep- and shallow-water barnacle species (Cirripedia Poecilasmatidae) provides insights into deep-sea adaptation of sessile crustaceans [O] . Zhibin Gan, Jianbo Yuan, Xinming Liu, 2020

机译：对深水和浅水藤壶物种（CirripediaPoecilasmatidae）进行的转录组比较分析为无柄甲壳动物的深海适应提供了见识。
10. Figure 2: Non-metric multidimensional scaling (nMDS) ordinations of Bray Curtis dissimilarity between the bacterial communities inhabiting shallow and deep corals (A). Beta diversity 3D PCoA plot based on Unifrac distance matrix. Superimposed on the PCoA plot are gray spheres indicating the most abundant bacterial taxa associated with shallow and deep corals. The sizes of the spheres represent the relative abundance of the taxon and the location of the spheres within the plot indicate sample-specific associations (B). Alpha diversity curves for Faith’s PD index comparing shallow and deep corals. Error bars in the figure correspond to one standard deviation out from the average (n = 3 biological replicates/depth) (C). [O] . -1

机译：图2：非公制多维缩放（NMDS）在浅层和深珊瑚（A）之间的细菌群落之间的Bray Curtis差异的条目。基于Unifrac距离矩阵的Beta多样性3D PCoA图。叠加在PCOA图上是灰色球体，表明与浅层和深珊瑚相关的最丰富的细菌分类群。球体的尺寸表示分类群的相对丰富，并且在图中的球体的位置表示特定于样本的关联（b）。信仰PD指数的阿尔法多样性曲线比较浅层和深珊瑚。图中的误差条对应于从平均值（n = 3生物重复/深度）（c）的一个标准偏差。
11. Bottom Backscattering Strengths Measured in Shallow and Deep Water. [R] . Gauss , R. C., Kunz, E. L., Fialkowski, J. M., 2017

机译：在浅水和深水中测量底部后向散射强度。

Is Deeper Better only when Shallow is Good?

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅