首页> 外文会议>IEEE International Congress on Big Data >Distributed frank-wolfe under pipelined stale synchronous parallelism
【24h】

Distributed frank-wolfe under pipelined stale synchronous parallelism

机译:分布在流水线上的坦率沃尔夫在流水线上同步并行

获取原文

摘要

Iterative-convergent algorithms represent an important family of applications in big data analytics. These are typically run on distributed processing frameworks deployed on a cluster of machines. On the other hand, we are witnessing the move towards data center operating systems (OS), where resources are unified by a resource manager and processing frameworks coexist with each other. In this context, different processing framework job tasks can be scheduled on the same machine and slow down a worker (straggler problem). Existing work has shown that an iteration model with relaxed consistency such as the Stale Synchronous Parallel (SSP) model, while still guaranteeing convergence, is able to cope with stragglers. In this paper we propose a model for the integration of the SSP model on a pipelined distributed processing framework. We then apply SSP on a distributed version of the FrankWolfe algorithm. We theoretically show its sparsity bounds and convergence under SSP. Finally, we experimentally show that the Frank-Wolfe algorithm applied on LASSO regression under SSP is able to converge faster than its BSP counterpart, especially under load conditions similar to those encountered in a data center OS.
机译:迭代收敛算法代表的大数据分析应用的一个重要的家庭。这些通常是在部署计算机集群上的分布式处理框架运行。在另一方面,我们正在目睹向数据中心操作系统(OS),其中将资源由资源管理和统一处理框架彼此共存的举动。在这种情况下,不同的处理框架的工作任务可以安排在同一台机器上,减缓工人(掉队的问题)。现有的工作已经表明,与诸如陈旧同步并行(SSP)模型松弛一致性迭代模型,同时仍然保证收敛性,是能够应付离散星。在本文中,我们提出了一个流水线分布式处理框架的SSP模型的集成模型。然后,我们在FrankWolfe算法的分布式版本适用SSP。从理论上证明SSP在其稀疏边界和衔接。最后,我们通过实验表明,弗兰克 - 沃尔夫算法应用于LASSO回归下SSP能够收敛速度比其对应的BSP,特别是在类似的数据中心操作系统遇到负载条件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号