【24h】

Asynchronous Block Coordinate Descent Method for Large-Scale Nonconvex Problem in Real World Study

机译:实际研究中的大规模非凸问题的异步块坐标下降法

获取原文
获取外文期刊封面目录资料

摘要

Here comes the era of big data in healthcare, and this era will transform medicine and especially oncology. In the last decade, there has been a growing interest in the potential benefits and the relevance of real world studies (RWS) with to the rapid development of data mining algorithms, as well as large-scale accumulated real world datasets. However, it is beyond the ability of traditional data mining algorithms to solve such large-scale RWS problems due to the large amount of data need to be processed and the difficulty in the problems solving, especially the nonconvex optimization problems. In many cases for these nonconvex problems, the goal is to find a reasonable local minimum, and the main concern is that the iterative updates are easily trapped in saddle points or bad local optima points. In this paper, we aim at minimizing the sum of a smooth nonlinear function and a block-separable nonconvex function involving a large number of block variables subjected to inequality constraints, which covers a wide range of interesting applications appearing in distributed optimization, statistics learning, etc. However, solving such a nonconvex nonlinear optimization problem associated with inequality constraints remains as a challenging task. This is because: 1) it is still an open problem about how to escape from saddle points and bad local optima points using a block coordinate descent method to deal with the nonconvex nonlinear optimization functions; 2) it can be extremely expensive to compute the entire block variables when the problem scale is large. Therefore, we propose a novel parallel first-order optimization method, called as Asynchronous block coordinate descent with Time Perturbation (ATP), which utilizing the time perturbation technique that escapes from saddle points and local optima points. We present a randomized block variable selection rule of the proposed method, as well as the iteration complexity. Experiments conducted on typical machine learning problems in RWS, i.e., folded concave linear regression, validate the efficacy of our optimization method. In particular, the experiment results demonstrate that time perturbation could greatly help the proposed algorithm escape from saddle points and local optima in the nonconvex part, which provides a promising way to tackle nonconvex optimization problems employing block coordinate descent.
机译:医疗保健大数据时代来了,这个时代将改变医学,尤其是肿瘤学。在过去的十年中,人们越来越关注现实世界研究(RWS)的潜在好处和相关性,这些研究与数据挖掘算法以及大规模累积的现实世界数据集的快速发展有关。然而,由于需要处理大量数据以及解决问题的难度,特别是非凸优化问题,解决了这种大规模的RWS问题超出了传统数据挖掘算法的能力。在许多情况下,对于这些非凸问题,目标是找到一个合理的局部最小值,而最主要的问题是迭代更新很容易陷入鞍点或不良的局部最优点中。在本文中,我们旨在使平滑非线性函数和块分离非凸函数的总和最小,该函数涉及受不等式约束的大量块变量,涵盖了分布优化,统计学习,然而,解决与不平等约束相关的非凸非线性优化问题仍然是一项艰巨的任务。这是因为:1)如何使用块坐标下降法从鞍点和不良的局部最优点逃脱,以处理非凸非线性优化函数仍然是一个悬而未决的问题。 2)当问题规模很大时,计算整个块变量可能会非常昂贵。因此,我们提出了一种新颖的并行一阶优化方法,称为带时间扰动(ATP)的异步块坐标下降,该方法利用了从鞍点和局部最优点逃逸的时间扰动技术。我们提出了该方法的随机块变量选择规则,以及迭代的复杂性。针对RWS中典型的机器学习问题(即折叠凹面线性回归)进行的实验验证了我们优化方法的有效性。特别是,实验结果表明,时间扰动可以极大地帮助所提出的算法摆脱非凸部分的鞍点和局部最优解,这为利用块坐标下降法解决非凸优化问题提供了一种有希望的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号