Distributed frank-wolfe under pipelined stale synchronous parallelism

机译：流水线过时的同步并行性下的分布式坦率狼

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Iterative-convergent algorithms represent an important family of applications in big data analytics. These are typically run on distributed processing frameworks deployed on a cluster of machines. On the other hand, we are witnessing the move towards data center operating systems (OS), where resources are unified by a resource manager and processing frameworks coexist with each other. In this context, different processing framework job tasks can be scheduled on the same machine and slow down a worker (straggler problem). Existing work has shown that an iteration model with relaxed consistency such as the Stale Synchronous Parallel (SSP) model, while still guaranteeing convergence, is able to cope with stragglers. In this paper we propose a model for the integration of the SSP model on a pipelined distributed processing framework. We then apply SSP on a distributed version of the FrankWolfe algorithm. We theoretically show its sparsity bounds and convergence under SSP. Finally, we experimentally show that the Frank-Wolfe algorithm applied on LASSO regression under SSP is able to converge faster than its BSP counterpart, especially under load conditions similar to those encountered in a data center OS.

机译：迭代收敛算法代表了大数据分析中的重要应用系列。这些通常在部署在机器集群上的分布式处理框架上运行。另一方面，我们正在目睹向数据中心操作系统（OS）的转变，在该系统中，资源由资源管理器统一，并且处理框架彼此并存。在这种情况下，可以在同一台计算机上安排不同的处理框架作业任务，并降低工作人员的速度（混乱的问题）。现有工作表明，具有宽松一致性的迭代模型（例如Stale同步并行（SSP）模型）在仍保证收敛的同时，能够应对散乱的问题。在本文中，我们提出了一个用于在流水线分布式处理框架上集成SSP模型的模型。然后，我们在FrankWolfe算法的分布式版本上应用SSP。我们从理论上说明了SSP下其稀疏性边界和收敛性。最后，我们通过实验证明，在SSP下应用于LASSO回归的Frank-Wolfe算法比BSP同类算法能够收敛更快，尤其是在类似于数据中心OS所遇到的负载条件下。

著录项

来源
《IEEE International Congress on Big Data》|2015年|184-192|共9页
会议地点
作者
Tran Nam-Luc; Peel Thomas; Skhiri Sabri;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Frank-Wolfe; LASSO regression; big data; distributed convex optimization; parameter server; stale synchronous parallel;

机译：Frank-Wolfe; LASSO回归;大数据;分布式凸优化;参数服务器;陈旧的同步并行;

相似文献

外文文献
中文文献
专利

1. A Case for Stale Synchronous Distributed Model for Declarative Recursive Computation [J] . Das Ariyam, Zaniolo Carlo Theory and Practice of Logic Programming . 2019,第5a6期

机译：声明式递归计算的陈旧同步分布式模型的案例
2. Adaptive structured parallelism for distributed heterogeneous architectures: a methodological approach with pipelines and farms [J] . Horacio Gonzalez-Velez, Murray Cole Concurrency, practice and experience . 2010,第15期

机译：分布式异构体系结构的自适应结构化并行性：使用管道和服务器场的方法学方法
3. Randomised block-coordinate Frank-Wolfe algorithm for distributed online learning over networks [J] . Jingchao Li, Qingtao Wu, Ruijuan Zheng, Cognitive Computation and Systems . 2020,第2期

机译：随机块 - 坐标弗兰克 - 沃尔夫在网络上分布在线学习的算法
4. Distributed frank-wolfe under pipelined stale synchronous parallelism [C] . Tran Nam-Luc, Peel Thomas, Skhiri Sabri IEEE International Congress on Big Data . 2015

机译：分布在流水线上的坦率沃尔夫在流水线上同步并行
5. Distributing Frank-Wolfe via Map-Reduce [D] . Moharrer, Armin. 2018

机译：通过Map-Reduce分发Frank-Wolfe
6. More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server [O] . Qirong Ho, James Cipar, Henggang Cui, -1

机译：通过一个陈旧的同步并行参数服务器更有效的分布式mL
7. TAPP: DNN Training for Task Allocation through Pipeline Parallelism Based on Distributed Deep Reinforcement Learning [O] . Yingchi Mao, Zijian Tu, Fagang Xi, 2021

机译：TAPP：通过基于分布式深度增强学习的管道并行性任务分配DNN培训

Distributed frank-wolfe under pipelined stale synchronous parallelism

摘要

著录项

相似文献

相关主题

期刊订阅