首页> 外文期刊>Computers & Industrial Engineering >Minimizing total job completion time in MapReduce scheduling
【24h】

Minimizing total job completion time in MapReduce scheduling

机译:最小化MapReduce Scheduling中的总作业完成时间

获取原文
获取原文并翻译 | 示例
           

摘要

We follow up an earlier studied multiple-task parallel-machine scheduling model that captures the core challenges in MapReduce scheduling, with the optimization goal to minimize the total job completion time. The problem is a novel generalization of the classic two-stage flow-shop scheduling, in which we are given a set of jobs each is associated with multiple map tasks and multiple reduce tasks. All these tasks are non-preemptive to be processed on multiple parallel identical map machines and multiple parallel identical reduce machines, respectively, under the strict precedence constraints that, for each job, any reduce task cannot start before all its map tasks have been finished. We prove a new lower bound on the total job completion time, and based on which we present an O(nlogn + N + m_1 + m_2)-time (9 -3/m_1)-approximation algorithm, where n and N are the number of jobs and the total number of tasks, respectively, mi and m.2 are the numbers of map and reduce machines, respectively, and they can even be part of the input. Our algorithm improves the previous best 12-approximation, and it reduces to the best approximation algorithms for several interesting or well studied special cases. We confirm through numerical experiments on 828,100 instances that both our lower bound and our algorithm significantly outperform: the empirical mean improvement ratio of the new lower bound is as high as 37.75%, and the empirical mean approximation ratio of our algorithm is only 1.7582.
机译:我们跟进早期研究的多项任务并行机器调度模型,捕获MapReduce调度中的核心挑战,优化目标可以最大限度地减少总作业完成时间。问题是经典的两阶段流程店调度的新推广,其中我们被给予一组作业,每个作业与多个映射任务和多个减少任务相关联。所有这些任务都是在多个并行相同的地图机上处理的非抢先性,并且分别在严格的优先约束下,对于每个作业,任何减少任务都无法启动,在完成所有映射任务之前,都无法启动。我们在总工作完成时间上证明了一个新的下限,并基于我们呈现O(nlogn + n + m_1 + m_2) - time(9 -3 / m_1) - 达到估计算法,其中n和n是数字作业和任务总数,分别是MI和M.2的数量分别是地图和减少机器的数量,并且它们甚至可以成为输入的一部分。我们的算法改善了以前的最佳12近似值,并且它减少了几个有趣或良好的特殊情况的最佳近似算法。我们通过828,100实例通过数值实验确认,我们的下限和我们的算法都显着优于胜过:新下限的经验性平均提高比率高达37.75%,而我们算法的实证平均近似比仅为1.7582。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号