...
首页> 外文期刊>ESAIM >RATE OF CONVERGENCE OF THE NESTEROV ACCELERATED GRADIENT METHOD IN THE SUBCRITICAL CASE α≤3
【24h】

RATE OF CONVERGENCE OF THE NESTEROV ACCELERATED GRADIENT METHOD IN THE SUBCRITICAL CASE α≤3

机译:亚临界情况下Nesterov加速梯度方法的收敛速度

获取原文
获取原文并翻译 | 示例
           

摘要

In a Hilbert space setting H, given Phi : H -> R a convex continuously differentiable function, and alpha a positive parameter, we consider the inertial dynamic system with Asymptotic Vanishing Damping(AVD)(alpha) (sic)(t) + alpha/t(x) over dot (t) + del Phi(x(t)) = 0.Depending on the value of alpha with respect to 3, we give a complete picture of the convergence properties as t -> +infinity of the trajectories generated by (AVD)(alpha), as well as iterations of the corresponding algorithms. Indeed, as shown by Su-Boyd-Candes, the case alpha = 3 corresponds to a continuous version of the accelerated gradient method of Nesterov, with the rate of convergence Phi(x(t)) - min Phi = O(t(-2)) for alpha >= 3. Our main result concerns the subcritical case alpha <= 3, where we show that Phi(x(t)) - min Phi = O (t(-2/3 alpha)). This overall picture shows a continuous variation of the rate of convergence of the values Phi(x(t)) - min(H) Phi = O (t(-p(alpha))) with respect to alpha > 0: the coefficient p(alpha) increases linearly up to 2 when alpha goes from 0 to 3, then displays a plateau. Then we examine the convergence of trajectories to optimal solutions. As a new result, in the one-dimensional framework, for the critical value alpha = 3, we prove the convergence of the trajectories. In the second part of this paper, we study the convergence properties of the associated forward-backward inertial algorithms. They aim to solve structured convex minimization problems of the form min {Theta := Phi + Psi}, with Phi smooth and Psi nonsmooth. The continuous dynamics serves as a guideline for this study. We obtain a similar rate of convergence for the sequence of iterates (x(k)): for alpha <= 3 we have Theta(x(k)) - min Theta = O(k(-p)) for all p < 2 alpha/3, and for alpha > 3 Theta(x(k)) - min Theta = o(k(-2)). Finally, we show that the results are robust with respect to external perturbations.
机译:在Hilbert空间设置H中,给定Phi:H-> R是凸连续可微函数,并且alpha是正参数,我们考虑具有渐近消失阻尼(AVD)(alpha)(sic)(t)+ alpha的惯性动力系统/ t(x)在点(t)+ del Phi(x(t))= 0上。根据相对于3的alpha值,我们给出了收敛特性的完整图,如t-> + (AVD)α生成的轨迹,以及相应算法的迭代。实际上,如Su-Boyd-Candes所示,α= 3的情况对应于Nesterov加速梯度方法的连续版本,收敛速度为Phi(x(t))-min Phi = O(t(- 2))对于alpha> =3。我们的主要结果涉及亚临界情况alpha <= 3,其中我们证明Phi(x(t))-min Phi = O(t(-2/3 alpha))。总体图片显示了相对于α> 0的值Phi(x(t))-min(H)Phi = O(t(-p(alpha)))的收敛速度的连续变化:系数p当alpha从0变为3时,α线性增加至2,然后显示平稳状态。然后,我们研究了轨迹到最优解的收敛性。作为一个新的结果,在一维框架中,对于临界值alpha = 3,我们证明了轨迹的收敛性。在本文的第二部分,我们研究了相关的向前-向后惯性算法的收敛性。他们旨在解决形式为{{Theta:= Phi + Psi}的结构化凸极小化问题,其中Phi光滑且Psi不光滑。连续动力学是本研究的指导。对于迭代(x(k))的序列,我们获得相似的收敛速度:对于alpha <= 3,对于所有p <2,我们都有Theta(x(k))-min Theta = O(k(-p)) alpha / 3,并且对于alpha> 3 Theta(x(k))-min Theta = o(k(-2))。最后,我们证明了相对于外部扰动而言,结果是可靠的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号