首页> 外文会议>International Conference on Parallel Processing Workshops >Software-Defined Data Shuffling for Big Data Jobs with Task Duplication
【24h】

Software-Defined Data Shuffling for Big Data Jobs with Task Duplication

机译:具有任务复制功能的大数据作业的软件定义数据改组

获取原文

摘要

Big data jobs are usually executed on large-scale distributed computing platforms that automatically divide a job into multiple computation phases, each of which contains a number of independent tasks that can run in parallel. The data shuffling process between two consecutive phases becomes the bottleneck of job execution. To improve its performance, an approach of "push" shuffling is proposed to send intermediate results to next phase immediately once they are generated. It avoids local disk accesses in the traditional "pull" shuffling approach, and tasks in the next phase can start data processing without waiting tasks in the predecessive phase to finish. Task duplication is another approach to accelerate task execution by launching multiple task copies that compete for processing the same data block. When "push" shuffling meets task duplication, big data jobs can be significantly accelerated, but leading to a large amount of redundant data transmission between two phases. To address this challenge, we propose a software-define data shuffling approach by designing a controller and a janitor module to control the data shuffling process. Each task has a janitor that communicates with the controller to request admission permit of sending intermediate results to next-stage tasks. We further propose an online grouping algorithm to reduce the overhead of frequent communication with the controller. The performance of the proposed algorithm is evaluated by extensive simulations.
机译:大数据作业通常在大型分布式计算平台上执行,该平台将作业自动划分为多个计算阶段,每个阶段包含许多可以并行运行的独立任务。两个连续阶段之间的数据改组过程成为作业执行的瓶颈。为了提高其性能,提出了一种“推”改组的方法,以便在生成中间结果后立即将其发送到下一个阶段。它避免了传统的“拉”式改组方法中的本地磁盘访问,并且下一阶段的任务可以开始数据处理,而无需等待先前阶段的任务完成。任务复制是通过启动多个竞争处理同一数据块的任务副本来加速任务执行的另一种方法。当“推送”改组遇到任务重复时,可以大大加速大数据作业,但会导致两个阶段之间大量冗余数据传输。为了解决这一挑战,我们提出了一种软件定义的数据改组方法,方法是设计控制器和管理员模块来控制数据改组过程。每个任务都有一个与控制器通信的看门人,以请求允许将中间结果发送给下一阶段任务的准入许可。我们进一步提出了一种在线分组算法,以减少与控制器频繁通信的开销。通过广泛的仿真评估了所提出算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号