首页> 外文会议>International Conference on Machine Learning >PA-GD: On the Convergence of Perturbed Alternating Gradient Descent to Second-Order Stationary Points for Structured Nonconvex Optimization

【24h】

PA-GD: On the Convergence of Perturbed Alternating Gradient Descent to Second-Order Stationary Points for Structured Nonconvex Optimization

机译：PA-GD：关于扰动交替梯度下降到结构性非凸化优化的二阶固定点的收敛性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Alternating gradient descent (A-GD) is a simple but popular algorithm in machine learning, which updates two blocks of variables in an alternating manner using gradient descent steps. In this paper, we consider a smooth unconstrained nonconvex optimization problem, and propose a perturbed A-GD (PA-GD) which is able to converge (with high probability) to the second-order stationary points (SOSPs) with a global sublinear rate. Existing analysis on A-GD type algorithm either only guarantees convergence to first-order solutions, or converges to second-order solutions asymptotically (without rates). To the best of our knowledge, this is the first alternating type algorithm that takes O(polylog(d)/ε~2) iterations to achieve an (ε, √ε)-SOSP with high probability, where polylog(d) denotes the polynomial of the logarithm with respect to problem dimension d.

机译：交替梯度下降（A-GD）是机器学习中的简单但流行的算法，其使用梯度血缘步骤更新以交替方式更新两个变量块。在本文中，我们考虑了一个平稳的无约束非凸优化问题，并提出了一种具有全球载速率的二阶静止点（SOSP）的扰动A-GD（PA-GD），其能够将（具有高概率）收敛到二阶固定点（SOSP）。对A-GD型算法的现有分析只能保证到一阶解决方案的融合，或者收敛到渐近的二阶解决方案（无速率）。据我们所知，这是第一种选择o（polylog（d）/ε〜2）迭代以实现具有高概率的（ε，√ε）-soSp的迭代，其中Polylog（d）表示关于问题尺寸D的对数的多项式。

著录项

来源
《International Conference on Machine Learning》|2019年|7044-7733p|共27页
会议地点
作者
Songtao Lu; Mingyi Hong; Zhengdao Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. Gradient Primal-Dual Algorithm Converges to Second-Order Stationary Solution for Nonconvex Distributed Optimization Over Networks [J] . Mingyi Hong, Meisam Razaviyayn, Jason Lee JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：梯度原始对偶算法收敛于网络非凸分布优化的二阶平稳解
2. Analytical convergence regions of accelerated gradient descent in nonconvex optimization under Regularity Condition [J] . Xiong Huaqing, Chi Yuejie, Hu Bin, Automatica . 2020,第1期

机译：规律性条件下非渗透优化加速梯度下降的分析趋同区
3. ALTERNATING STRUCTURE-ADAPTED PROXIMAL GRADIENT DESCENT FOR NONCONVEX NONSMOOTH BLOCK-REGULARIZED PROBLEMS [J] . Nikolova Mila, Tan Pauline SIAM Journal on Optimization: A Publication of the Society for Industrial and Applied Mathematics . 2019,第3期

机译：交替的结构适应近端梯度下降，用于非凸起的非光滑块 - 正则化问题
4. PA-GD: On the Convergence of Perturbed Alternating Gradient Descent to Second-Order Stationary Points for Structured Nonconvex Optimization [C] . Songtao Lu, Mingyi Hong, Zhengdao Wang International Conference on Machine Learning . 2019

机译：PA-GD：关于扰动交替梯度下降到结构性非凸化优化的二阶固定点的收敛性
5. When Can Nonconvex Optimization Problems Be Solved with Gradient Descent? A Few Case Studies [D] . Gilboa, Dar. 2020

机译：何时可以用梯度下降解决非渗透优化问题？一些案例研究
6. Beyond convexity—Contraction and global convergence of gradient descent [O] . Patrick M. Wensing, Jean-Jacques Slotine 2020

机译：超越凸起 - 收缩和梯度下降的全局融合
7. Analytical convergence regions of accelerated gradient descent in nonconvex optimization under Regularity Condition [O] . Huaqing Xiong, Yuejie Chi, Bin Hu, 2020

机译：规律性条件下非渗透优化加速梯度下降的分析趋同区

PA-GD: On the Convergence of Perturbed Alternating Gradient Descent to Second-Order Stationary Points for Structured Nonconvex Optimization

摘要

著录项

相似文献

相关主题

期刊订阅