首页> 外文OA文献 >A Taylor polynomial expansion line search for large-scale optimization

【2h】

A Taylor polynomial expansion line search for large-scale optimization

机译：泰勒多项式展开线搜索用于大规模优化

页面导航

摘要
著录项
相似文献
相关主题

摘要

In trying to cope with the Big Data deluge, the landscape of distributed computing has changed. Large commodity hardware clusters, typically operating in some form of MapReduce framework, are becoming prevalent for organizations that require both tremendous storage capacity and fault tolerance. However, the high cost of communication can dominate the computation time in large-scale optimization routines in these frameworks. This thesis considers the problem of how to efficiently conduct univariate line searches in commodity clusters in the context of gradient-based batch optimization algorithms, like the staple limited-memory BFGS (LBFGS) method. In it, a new line search technique is proposed for cases where the underlying objective function is analytic, as in logistic regression and low rank matrix factorization. The technique approximates the objective function by a truncated Taylor polynomial along a fixed search direction. The coefficients of this polynomial may be computed efficiently in parallel with far less communication than needed to transmit the high-dimensional gradient vector, after which the polynomial may be minimized with high accuracy in a neighbourhood of the expansion point without distributed operations. This Polynomial Expansion Line Search (PELS) may be invoked iteratively until the expansion point and minimum are sufficiently accurate, and can provide substantial savings in time and communication costs when multiple iterations in the line search procedure are required.Three applications of the PELS technique are presented herein for important classes of analytic functions: (i) logistic regression (LR), (ii) low-rank matrix factorization (MF) models, and (iii) the feedforward multilayer perceptron (MLP). In addition, for LR and MF, implementations of PELS in the Apache Spark framework for fault-tolerant cluster computing are provided. These implementations conferred significant convergence enhancements to their respective algorithms, and will be of interest to Spark and Hadoop practitioners. For instance, the Spark PELS technique reduced the number of iterations and time required by LBFGS to reach terminal training accuracies for LR models by factors of 1.8--2. Substantial acceleration was also observed for the Nonlinear Conjugate Gradient algorithm for MLP models, which is an interesting case for future study in optimization for neural networks. The PELS technique is applicable to a broad class of models for Big Data processing and large-scale optimization, and can be a useful component of batch optimization routines.

机译：为了应对大数据洪水，分布式计算的格局已经改变。大型商品硬件集群通常以某种形式的MapReduce框架运行，对于同时需要巨大存储容量和容错能力的组织来说，这种集群正变得越来越普遍。但是，在这些框架中，高昂的通信成本可能会占据大规模优化例程中的计算时间。本文考虑了如何在基于梯度的批量优化算法（如装订有限内存BFGS（LBFGS）方法）的背景下有效地进行商品集群中的单变量线搜索的问题。其中，针对逻辑目标回归和低秩矩阵分解的情况，针对潜在目标函数被解析的情况，提出了一种新的线搜索技术。该技术通过沿固定搜索方向的截断泰勒多项式近似目标函数。可以以比传输高维梯度向量所需的通信少得多的通信并行地有效地并行计算该多项式的系数，此后可以在扩展点附近以高精度将多项式最小化而无需分布式操作。可以迭代调用此多项式扩展线搜索（PELS），直到扩展点和最小值足够准确为止，并且当需要在线搜索过程中进行多次迭代时，可以大大节省时间和通信成本。PELS技术的三个应用是本文针对重要的分析功能类别提供了以下内容：（i）Logistic回归（LR），（ii）低秩矩阵分解（MF）模型和（iii）前馈多层感知器（MLP）。此外，对于LR和MF，还提供了Apache Spark框架中用于容错群集计算的PELS实现。这些实现为它们各自的算法带来了显着的融合增强，Spark和Hadoop从业者将对此感兴趣。例如，Spark PELS技术将LBFGS达到LR模型的最终训练精度所需的迭代次数和时间减少了1.8--2倍。对于MLP模型的非线性共轭梯度算法也观察到了显着的加速，这是未来神经网络优化研究的一个有趣案例。 PELS技术适用于大数据处理和大规模优化的各种模型，并且可以作为批处理优化例程的有用组成部分。

著录项

作者
Hynes Michael`;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Large-scale matrix diagonalization methods by direct optimization of Taylor expansion of Rayleigh-Ritz quotient up to third order [J] . Bofill JM., Anglada JM., Illas F., Chemical Physics Letters . 2000 ,第1a2期

机译：通过直接优化Rayleigh-Ritz商直到三阶的Taylor展开来大规模矩阵对角化方法
2. q-Analogs of Lidstone expansion theorem, two-point Taylor expansion theorem, and Bernoulli polynomials [J] . Ismail Mourad E. H., Mansour Zeinab S. I Analysis and applications . 2019 ,第6期

机译：Lidstone扩展定理的Q-类似物，双点泰勒扩展定理和伯努利多项式
3. Deinterlacing Using Taylor Series Expansion and Polynomial Regression [J] . Wang J., Jeon G., Jeong J. Circuits and Systems for Video Technology, IEEE Transactions on . 2013 ,第5期

机译：使用泰勒级数展开和多项式回归解交织
4. A polynomial expansion line search for large-scale unconstrained minimization of smooth L_2-regularized loss functions, with implementation in Apache Spark [C] . Michael B. Hynes, Hans De Sterck SIAM International Conference on Data Mining . 2016

机译：多项式扩展线搜索平滑L_2-正规损耗函数的大规模无约束最小化，实现Apache Spark
5. Optimization of Fixed-Point Circuits Represented by Taylor Series and Real-Valued Polynomials Including Analysis of Precision and Range. [D] . Pang, Yu. 2010

机译：由泰勒级数和实数值多项式表示的定点电路的优化，包括精度和范围分析。
6. Optimization of Large-Scale Expansion and Cryopreservation of Human Natural Killer Cells for Anti-Tumor Therapy [O] . Bokyung Min, Hana Choi, Jung Hyun Her, 2018

机译：人天然杀伤细胞大规模扩增和冷冻保存抗肿瘤治疗的优化
7. A polynomial expansion line search for large-scale unconstrained minimization of smooth L2-regularized loss functions, with implementation in Apache Spark [O] . Michael B Hynes, Hans De Sterck 2016

机译：多项式扩展线搜索平滑L2定期损耗功能的大规模无约束最小化，实现Apache Spark

A Taylor polynomial expansion line search for large-scale optimization

摘要

著录项

相似文献

相关主题

期刊订阅