首页> 外文期刊>JMLR: Workshop and Conference Proceedings >On Constrained Nonconvex Stochastic Optimization: A Case Study for Generalized Eigenvalue Decomposition
【24h】

On Constrained Nonconvex Stochastic Optimization: A Case Study for Generalized Eigenvalue Decomposition

机译:约束非凸随机优化研究:以广义特征值分解为例

获取原文
获取外文期刊封面目录资料

摘要

We study constrained nonconvex optimization problems in machine learning and signal processing. It is well-known that these problems can be rewritten to a min-max problem in a Lagrangian form. However, due to the lack of convexity, their landscape is not well understood and how to find the stable equilibria of the Lagrangian function is still unknown. To bridge the gap, we study the landscape of the Lagrangian function. Further, we define a special class of Lagrangian functions. They enjoy the following two properties: 1. Equilibria are either stable or unstable (Formal definition in Section 2); 2.Stable equilibria correspond to the global optima of the original problem. We show that a generalized eigenvalue (GEV) problem, including canonical correlation analysis and other problems as special examples, belongs to the class. Specifically, we characterize its stable and unstable equilibria by leveraging an invariant group and symmetric property (more details in Section 3). Motivated by these neat geometric structures, we propose a simple, efficient, and stochastic primal-dual algorithm solving the online GEV problem. Theoretically, under sufficient conditions, we establish an asymptotic rate of convergence and obtain the first sample complexity result for the online GEV problem by diffusion approximations, which are widely used in applied probability. Numerical results are also provided to support our theory.
机译:我们研究机器学习和信号处理中的约束非凸优化问题。众所周知,这些问题可以以拉格朗日形式重写为最小-最大问题。然而,由于缺乏凸性,它们的面貌还没有得到很好的理解,如何找到拉格朗日函数的稳定平衡点仍然未知。为了弥合差距,我们研究了拉格朗日函数的景观。此外,我们定义了一类特殊的拉格朗日函数。它们具有以下两个属性:1.平衡是稳定的或不稳定的(第2节中的正式定义); 2.稳定的均衡对应于原始问题的全局最优。我们表明,广义特征值(GEV)问题,包括典范相关分析和其他作为特殊示例的问题,都属于该类。具体来说,我们通过利用不变基团和对称性质来描述其稳定和不稳定的平衡(更多详细信息请参见第3节)。基于这些整洁的几何结构,我们提出了一种简单,有效且随机的原始对偶算法来解决在线GEV问题。从理论上讲,在足够的条件下,我们建立渐近收敛速度,并通过扩散近似获得在线GEV问题的第一个样本复杂度结果,该结果在应用概率中得到了广泛应用。还提供了数值结果以支持我们的理论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号