Asynchronous parallel stochastic Quasi-Newton methods

Tong Qianqian; Liang Guannan; Cai Xingyu; Zhu Chunjiang; Bi Jinbo

首页> 外文期刊>Parallel Computing >Asynchronous parallel stochastic Quasi-Newton methods

【24h】

Asynchronous parallel stochastic Quasi-Newton methods

机译：异步并行随机拟牛顿方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Although first-order stochastic algorithms, such as stochastic gradient descent, have been the main force to scale up machine learning models, such as deep neural nets, the second-order quasi-Newton methods start to draw attention due to their effectiveness in dealing with ill-conditioned optimization problems. The L-BFGS method is one of the most widely used quasi-Newton methods. We propose an asynchronous parallel algorithm for stochastic quasi-Newton (AsySQN) method. Unlike prior attempts, which parallelize only the calculation for gradient or the two-loop recursion of L-BFGS, our algorithm is the first one that truly parallelizes L-BFGS with a convergence guarantee. Adopting the variance reduction technique, a prior stochastic L-BFGS, which has not been designed for parallel computing, reaches a linear convergence rate. We prove that our asynchronous parallel scheme maintains the same linear convergence rate but achieves significant speedup. Empirical evaluations in both simulations and benchmark datasets demonstrate the speedup in comparison with the non-parallel stochastic L-BFGS, as well as the better performance than first-order methods in solving ill-conditioned problems.

机译：虽然是随机梯度下降的一阶随机算法，但是缩放机器学习模型的主力，如深神经网络，二阶拟牛顿方法由于其在处理中的有效性而开始引起注意不良的优化问题。 L-BFGS方法是最广泛使用的准牛顿方法之一。我们提出了一种用于随机准牛顿（asysqn）方法的异步并行算法。与现有尝试不同，该尝试仅对L-BFGS的梯度或双环递归的计算并行化，我们的算法是第一个与收敛保证的L-BFG相行的第一个。采用差异减少技术，一种未被设计用于并行计算的先前随机L-BFG，达到线性收敛速率。我们证明我们的异步并行方案保持相同的线性收敛速率，但实现了显着的加速。与非平行随机L-BFG相比，两种模拟和基准数据集中的经验评估展示了加速，以及比解决不良问题的一阶方法更好的性能。

著录项

来源
《Parallel Computing》 |2021年第4期|102721.1-102721.12|共12页
作者
Tong Qianqian; Liang Guannan; Cai Xingyu; Zhu Chunjiang; Bi Jinbo;
展开▼
作者单位

Univ Connecticut Storrs CT 06269 USA;

Univ Connecticut Storrs CT 06269 USA;

Baidu USA Sunnyvale CA 94089 USA;

Univ Connecticut Storrs CT 06269 USA;

Univ Connecticut Storrs CT 06269 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Quasi-Newton method; Asynchronous parallel; Stochastic algorithm; Variance reduction;

机译：准牛顿方法;异步并行;随机算法;减少方差;

相似文献

外文文献
中文文献
专利

1. Convergence and Numerical Results for a Parallel Asynchronous Quasi-Newton Method [J] . D. Conforti, R. Musmanno Journal of Optimization Theory and Applications . 1995,第2期

机译：并行异步准牛顿方法的收敛性和数值结果
2. Improved Asynchronous Parallel Optimization Analysis for Stochastic Incremental Methods [J] . Remi Leblond, Fabian Pedregosa, Simon Lacoste-Julien Journal of machine learning research . 2018,第a期

机译：随机增量方法改进异步并行优化分析
3. Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization [J] . Umut Simsekli, Cagatay Yildiz, Than Huy Nguyen, JMLR: Workshop and Conference Proceedings . 2018,第3期

机译：用于非凸优化的异步随机拟牛顿MCMC
4. A Variable Sample-Size Stochastic Quasi-Newton Method for Smooth and Nonsmooth Stochastic Convex Optimization [C] . Afrooz Jalilzadeh, Angelia Nedić, Uday V. Shanbhag, IEEE Conference on Decision and Control . 2018

机译：光滑和不光滑随机凸优化的可变样本大小随机拟牛顿法
5. Quasi-Newton methods for stochastic optimization and proximity-based methods for disparate information fusion. [D] . Castle, Brent. 2012

机译：随机优化的拟牛顿法和异构信息融合的基于接近度的方法。
6. Multi-Sensor Data Fusion Identification for Shearer Cutting Conditions Based on Parallel Quasi-Newton Neural Networks and the Dempster-Shafer Theory [O] . Lei Si, Zhongbin Wang, Xinhua Liu, 2015

机译：基于并行拟牛顿神经网络和Dempster-Shafer理论的采煤机切割条件多传感器数据融合识别
7. Asynchronous parallel stochastic Quasi-Newton methods [O] . Qianqian Tong, Guannan Liang, Xingyu Cai, 2021

机译：异步并行随机拟牛顿方法
8. Parallel Quasi-Newton Methods for Unconstrained Optimization [R] . Byrd, R. H., Schnabel, R. B., Shultz, G. A. 1988

机译：无约束优化的并行拟牛顿法

Asynchronous parallel stochastic Quasi-Newton methods

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅