...
首页> 外文期刊>Journal of Scientific Computing >The Chaotic Nature of Faster Gradient Descent Methods
【24h】

The Chaotic Nature of Faster Gradient Descent Methods

机译:快速梯度下降方法的混沌性质

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The steepest descent method for large linear systems is well-known to often converge very slowly, with the number of iterations required being about the same as that obtained by utilizing a gradient descent method with the best constant step size and growing proportionally to the condition number. Faster gradient descent methods must occasionally resort to significantly larger step sizes, which in turn yields a rather non-monotone decrease pattern in the residual vector norm. We show that such faster gradient descent methods in fact generate chaotic dynamical systems for the normalized residual vectors. Very little is required to generate chaos here: simply damping steepest descent by a constant factor close to 1 will do. Several variants of the family of faster gradient descent methods are investigated, both experimentally and analytically. The fastest practical methods of this family in general appear to be the known, chaotic, two-step ones. Our results also highlight the need of better theory for existing faster gradient descent methods.
机译:众所周知,大型线性系统的最速下降法通常收敛很慢,所需的迭代次数与利用具有最佳恒定步长并与条件数成比例增长的梯度下降法获得的迭代次数大致相同。 。较快的梯度下降方法有时必须求助于较大的步长,从而在残差矢量范数中产生相当非单调的下降模式。我们表明,这种更快的梯度下降方法实际上为归一化的残差矢量生成了混沌动力学系统。在这里产生混乱的要求很小:只需将最陡峭的下降阻尼到接近1的常数即可。通过实验和分析,研究了快速梯度下降方法家族的几种变体。通常,该家族中最快的实用方法似乎是已知的,混乱的两步法。我们的结果还强调了对现有更快的梯度下降方法需要更好的理论的需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号