The Implicit Regularization of Stochastic Gradient Flow for Least Squares

机译：随机梯度流动的隐式正则化最小二乘法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the implicit regularization of mini-batch stochastic gradient descent, when applied to the fundamental problem of least squares regression. We leverage a continuous-time stochastic differential equation having the same moments as stochastic gradient descent, which we call stochastic gradient flow. We give a bound on the excess risk of stochastic gradient flow at time t, over ridge regression with tuning parameter λ = 1/t. The bound may be computed from explicit constants (e.g., the mini-batch size, step size, number of iterations), revealing precisely how these quantities drive the excess risk. Numerical examples show the bound can be small, indicating a tight relationship between the two estimators. We give a similar result relating the coefficients of stochastic gradient flow and ridge. These results hold under no conditions on the data matrix X, and across the entire optimization path (not just at convergence).

机译：当应用于最小二乘回归的基本问题时，我们研究了迷你批量随机梯度下降的隐含正则化。我们利用具有与随机梯度下降相同时刻的连续时间随机微分方程，我们称之为随机梯度流。我们在时间t在时间t的过度风险，通过调谐参数λ= 1 / t的脊回归给出了随机梯度流量的过度风险。可以从显式常数（例如，迷你批量大小，迭代次数，迭代的数量）计算界限，恰恰阐述了这些数量如何推动过度的风险。数值示例显示界限可以很小，表示两个估计器之间的紧密关系。我们给出了类似结果，与随机梯度流动和脊的系数相关。这些结果在数据矩阵x上没有条件下保持，并且整个整个优化路径（不仅在收敛时）。

著录项

来源
《International Conference on Machine Learning》|2021年|786p|共12页
会议地点
作者
Alnur Ali; Edgar Dobriban; Ryan J. Tibshirani;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. AgFlow: fast model selection of penalized PCA via implicit regularization effects of gradient flow [J] . Jiang Haiyan, Xiong Haoyi, Wu Dongrui, Machine Learning . 2021,第8期

机译：AGFLIF：通过梯度流的隐式正则化效果快速模型选择惩罚PCA
2. Mean-square A-stable diagonally drift-implicit integrators of weak second order for stiff ito stochastic differential equations Mean-square A-stable diagonally drift-implicit integrators of weak [J] . A. Abdulle, G. Vilmart, K.C. Zygalakis BIT numerical mathematics . 2013,第4期

机译：刚性itto随机微分方程的弱二阶均方A稳定对角漂移隐积分
3. Regularization of continuum damage mechanics models for 3-D brittle materials using implicit gradient enhancement [J] . Mondal S., Olsen-Kettle L. M., Gross L. Computers and Geotechnics . 2020,第Juna期

机译：使用隐式梯度增强的3-D脆性材料的连续损伤力学模型的正则化
4. The Implicit Regularization of Stochastic Gradient Flow for Least Squares [C] . Alnur Ali, Edgar Dobriban, Ryan J. Tibshirani International Conference on Machine Learning . 2021

机译：随机梯度流动的隐式正则化最小二乘法
5. Averaging Projected Stochastic Gradient Descent for large scale least square problem. [D] . Mu, Yang. 2012

机译：对大型最小二乘问题平均投影随机梯度下降。
6. Implicit Stochastic Gradient Descent Method for Cross-Domain Recommendation System [O] . Nam D. Vo, Minsung Hong, Jason J. Jung 2020

机译：跨域推荐系统的隐式随机梯度下降方法
7. Regularization of microplane damage models using an implicit gradient enhancement [O] . Zreid Imadeddin, Kaliske Michael 2014

机译：使用隐式梯度增强对微平面损伤模型进行正则化
8. Statistical Analysis of the LMS (Last Mean Squares) and Modified Stochastic Gradient [R] . Bershad 1988

机译：Lms（末端均方）和修正随机梯度的统计分析

The Implicit Regularization of Stochastic Gradient Flow for Least Squares

摘要

著录项

相似文献

相关主题

期刊订阅