Approximation Schemes for ReLU Regression

Ilias Diakonikolas; Surbhi Goel; Sushrut Karmalkar; Adam R. Klivans; Mahdi Soltanolkotabi

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Approximation Schemes for ReLU Regression

【24h】

Approximation Schemes for ReLU Regression

机译：Relu回归的近似方案

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the fundamental problem of ReLU regression, where the goal is to output the best fitting ReLU with respect to square loss given access to draws from some unknown distribution. We give the first efficient, constant-factor approximation algorithm for this problem assuming the underlying distribution satisfies some weak concentration and anti-concentration conditions (and includes, for example, all log-concave distributions). This solves the main open problem of Goel et al., who proved hardness results for any exact algorithm for ReLU regression (up to an additive $epsilon$). Using more sophisticated techniques, we can improve our results and obtain a polynomial-time approximation scheme for any subgaussian distribution. Given the aforementioned hardness results, these guarantees can not be substantially improved. Our main insight is a new characterization of {em surrogate losses} for nonconvex activations. While prior work had established the existence of convex surrogates for monotone activations, we show that properties of the underlying distribution actually induce strong convexity for the loss, allowing us to relate the global minimum to the activation’s {em Chow parameters}.

机译：我们考虑Relu回归的根本问题，其中目标是在给出的广场丢失上输出最佳拟合Relu，从而从一些未知的分发绘制。假设底层分布满足一些弱浓度和抗浓缩条件（并且包括所有对数凹发行分布），我们给出了这个问题的第一个有效的恒因子近似算法。这解决了Goel等人的主要开放问题。，据证明了对Relu回归的任何确切算法的硬度结果（最多为Addive $ epsilon $）。使用更复杂的技术，我们可以提高我们的结果，并获得任何子静脉分布的多项式近似方案。鉴于上述硬度结果，这些保证不能显着改善。我们的主要洞察力是非渗透激活的{ EM代理损失}的新表征。虽然事先工作已经建立了单调激活的凸代替代品的存在，但我们表明底层分布的属性实际上诱导了损失的强大凸，允许我们将全局最小值与激活的{ EM Chow参数}联系起来。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2020年第2010期|共34页
作者
Ilias Diakonikolas; Surbhi Goel; Sushrut Karmalkar; Adam R. Klivans; Mahdi Soltanolkotabi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Approximation by Combinations of ReLU and Squared ReLU Ridge Functions With$ell^1$and$ell^0$Controls [J] . Jason M. Klusowski, Andrew R. Barron IEEE Transactions on Information Theory . 2018,第12期

机译：结合使用ReLU和平方ReLU Ridge函数并以 $ ell ^ 1 $ 和 $ ell ^ 0 $ 控件
2. Error bounds for approximations with deep ReLU networks [J] . Dmitry Yarotsky Neural Networks: The Official Journal of the International Neural Network Society . 2017,第期

机译：具有深度Relu网络的近似误差界限
3. Optimal function approximation with ReLU neural networks [J] . Liu Bo, Liang Yi Neurocomputing . 2021,第MAYa7期

机译：relu神经网络的最佳函数近似
4. Efficient Approximation of Deep ReLU Networks for Functions on Low Dimensional Manifolds [C] . Minshuo Chen, Haoming Jiang, Wenjing Liao, Conference on Neural Information Processing Systems . 2020

机译：低维歧管功能的深度Relu网络的高效近似
5. Assessing thought disordered behavior using finite mixture models and comparing approximations for logistic regression. [D] . Morgan, Charity Johanna. 2008

机译：使用有限的混合模型评估思想混乱的行为，并比较逻辑回归的近似值。
6. Edge-based nonlinear diffusion for finite element approximations of convection–diffusion equations and its relation to algebraic flux-correction schemes [O] . Gabriel R. Barrenechea, Erik Burman, Fotini Karakatsani -1

机译：对流扩散方程有限元逼近的基于边缘的非线性扩散及其与代数通量校正方案的关系
7. Approximation by Combinations of ReLU and Squared ReLU Ridge Functions With $ell^1$ and $ell^0$ Controls [O] . Jason M. Klusowski, Andrew R. Barron 2018

机译：近似通过relu和平方ride函数的函数与 $ et ^ 1 $ 和 $ ell ^ 0 $ 控件

Approximation Schemes for ReLU Regression

摘要

著录项

相似文献

相关主题

期刊订阅