SGD-rα: A real-time α-suffix averaging method for SGD with biased gradient estimates

Luo Jianqi; Liu Jinlan; Xu DongpoZhang Huisheng

首页> 外文期刊>Neurocomputing >SGD-rα: A real-time α-suffix averaging method for SGD with biased gradient estimates

【24h】

SGD-rα: A real-time α-suffix averaging method for SGD with biased gradient estimates

机译：SGD-rα: A real-time α-suffix averaging method for SGD with biased gradient estimates

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Stochastic gradient descent (SGD) is a simple and efficient method for solving large-scale stochastic optimization problems in machine learning. It has been shown that the convergence rate of SGD can be improved by alpha-suffix averaging technique, abbreviated as SGD-alpha. Classic analysis usually requires the assumption of unbiased gradient estimates, which is not suitable for many practical applications of SGD-alpha such as non-independently and identically distributed (non-i.i.d.) scenarios. Another limitation is that SGD-alpha needs to store all iterates in memory and thus cannot be implemented on-the-fly. To address the issues, we employ rounding technique to propose a real-time version of SGD-alpha (named SGD-r alpha), which can iteratively calculate the alpha-suffix averaging and has the same convergence rate as that of SGD-alpha. In particular, SGD-r alpha with biased gradient estimates can obtain sublinear convergence rate for strongly convex objectives. Numerical experiments on the benchmark datasets have shown the characteristics of SGD-r alpha and corroborated the theoretical results. The implementation of SGD-r alpha is available at:https://github.com/xudp100/SGD-ra. (C) 2022 Elsevier B.V. All rights reserved.

著录项

来源
《Neurocomputing》 |2022年第28期|1-8|共8页
作者
Luo Jianqi; Liu Jinlan; Xu DongpoZhang Huisheng;
展开▼
作者单位

Northeast Normal Univ, Sch Math & Stat, Changchun 130024, Peoples R China;

Dalian Maritime Univ, Dept Math, Dalian 116026, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Stochastic gradient descent; alpha-suffix averaging; Biased gradient estimates; Convergence; Machine learning;

SGD-rα: A real-time α-suffix averaging method for SGD with biased gradient estimates

摘要

著录项

相关主题

期刊订阅