...
首页> 外文期刊>Journal of Parallel and Distributed Computing >Joint scheduling of MapReduce jobs with servers: Performance bounds and experiments
【24h】

Joint scheduling of MapReduce jobs with servers: Performance bounds and experiments

机译:与服务器联合调度MapReduce作业:性能界限和实验

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

MapReduce-like frameworks have achieved tremendous success for large-scale data processing in data centers. A key feature distinguishing MapReduce from previous parallel models is that it interleaves parallel and sequential computation. Past schemes, and especially their theoretical bounds, on general parallel models are therefore, unlikely to be applied to MapReduce directly. There are many recent studies on MapReduce job and task scheduling. These studies assume that the servers are assigned in advance. In current data centers, multiple MapReduce jobs of different importance levels run together. In this paper, we investigate a schedule problem for MapReduce taking server assignment into consideration as well. We formulate a MapReduce server-job organizer problem (MSJO) and show that it is NP-complete. We develop a 3-approximation algorithm and a fast heuristic design. Moreover, we further propose a novel fine-grained practical algorithm for general MapReduce-like task scheduling problem. Finally, we evaluate our algorithms through both simulations and experiments on Amazon EC2 with an implementation with Hadoop. The results confirm the superiority of our algorithms.
机译:类似于MapReduce的框架在数据中心的大规模数据处理方面已经取得了巨大的成功。 MapReduce与以前的并行模型区别的一个关键功能是它交错并行和顺序计算。因此,一般并行模型上的过去方案,尤其是它们的理论界限,不太可能直接应用于MapReduce。最近有许多关于MapReduce作业和任务调度的研究。这些研究假定服务器是预先分配的。在当前的数据中心中,具有不同重要性级别的多个MapReduce作业可以同时运行。在本文中,我们还考虑了服务器分配,研究了MapReduce的调度问题。我们制定了一个MapReduce服务器-作业组织者问题(MSJO),并证明它是NP完全的。我们开发了一种3近似算法和一种快速的启发式设计。此外,我们还针对类MapReduce的一般任务调度问题提出了一种新颖的细粒度实用算法。最后,我们通过在具有Hadoop实施的Amazon EC2上进行仿真和实验来评估算法。结果证实了我们算法的优越性。

著录项

  • 来源
  • 作者单位

    Tsinghua National Laboratory for Information Science and Technology, Institute for Network Sciences and Cyberspace, Tsinghua University, Beijing, China;

    Department of Computing Science, The Hong Kong Polytechnic University, Hong Kong;

    Department of Computing Science, The Hong Kong Polytechnic University, Hong Kong;

    Department of Computing Science, Simon Fraser University, British Columbia, Canada;

    Tsinghua National Laboratory for Information Science and Technology, Institute for Network Sciences and Cyberspace, Tsinghua University, Beijing, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    MapReduce; Scheduling; Server assignment; NP-complete; Fast heuristic;

    机译:MapReduce;排程;服务器分配;NP完全;快速启发式;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号