...
首页> 外文期刊>Signal processing >Accelerated stochastic gradient descent with step size selection rules
【24h】

Accelerated stochastic gradient descent with step size selection rules

机译:具有步长选择规则的加速随机梯度下降

获取原文
获取原文并翻译 | 示例

摘要

Accelerated stochastic gradient descent (ASGD) methods, which incorporate accelerated proximal gradient (APG) and stochastic gradient (SG), have received considerable attention recently for solving regularized risk minimization problems in signal/image processing, statistics and machine learning. However, there has been a paucity of practical guidance proposed for resolving one of the major issues in ASGD: how to choose an appropriate step size. To solve this problem, we propose to use the Barzilai-Borwein (BB) method to automatically compute step size for the accelerated mini-batch Prox-SVRG (Acc-Prox-SVRG) method (the state of the art ASGD method), thereby obtaining a new accelerated method: Acc-Prox-SVRGBB. We prove the convergence of Acc-Prox-SVRG-BB and show that its complexity is comparable with the best known stochastic gradient methods. In addition, we incorporate Beck and Teboulle's APG (FISTA) and Prox-SVRG in a mini-batch setting and obtain another new accelerated gradient descent method, FISTA-Prox-SVRG, which requires the selection of fewer unknown parameters than those required in Acc-Prox-SVRG. Finally, we introduce the BB method into FISTA-Prox-SVRG to further show the efficacy of the BB method. Numerical results demonstrate the advantage of our algorithms. (C) 2019 Elsevier B.V. All rights reserved.
机译:加速随机梯度下降(ASGD)方法结合了加速近端梯度(APG)和随机梯度(SG),最近在解决信号/图像处理,统计和机器学习中的正则化风险最小化问题方面引起了极大关注。但是,为解决ASGD中的主要问题之一,建议的实用指南很少:如何选择合适的步长。为了解决此问题,我们建议使用Barzilai-Borwein(BB)方法来自动计算加速小批量Prox-SVRG(Acc-Prox-SVRG)方法(最新的ASGD方法)的步长,从而获得一种新的加速方法:Acc-Prox-SVRGBB。我们证明了Acc-Prox-SVRG-BB的收敛性,并证明了其复杂性可与最著名的随机梯度法相比。此外,我们将Beck and Teboulle的APG(FISTA)和Prox-SVRG纳入了一个小批量设置中,并获得了另一种新的加速梯度下降方法FISTA-Prox-SVRG,该方法所需的未知参数比Acc中所需的更少-Prox-SVRG。最后,我们将BB方法引入FISTA-Prox-SVRG,以进一步证明BB方法的有效性。数值结果证明了我们算法的优势。 (C)2019 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Signal processing 》 |2019年第6期| 171-186| 共16页
  • 作者单位

    Xiamen Univ, Sch Informat Sci & Engn, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen 361005, FJ, Peoples R China|Sun Yat Sen Univ, Sch Elect & Commun Engn, Guangzhou 510275, Guangdong, Peoples R China;

    Xiamen Univ, Sch Informat Sci & Engn, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen 361005, FJ, Peoples R China;

    Xiamen Univ, Sch Informat Sci & Engn, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen 361005, FJ, Peoples R China;

    Xiamen Univ, Sch Informat Sci & Engn, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen 361005, FJ, Peoples R China|Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Empirical risk minimization; Accelerated proximal gradient method; Mini-batches; Barzilai-Borwein method; Variance reduction; Convex optimization;

    机译:经验风险最小化;加速近端梯度法;小批量;Barzilai-Borwein方法;方差降低;凸优化;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号