首页> 外文会议>IEEE International Congress on Big Data >Scheduling MapReduce tasks on virtual MapReduce clusters from a tenant's perspective
【24h】

Scheduling MapReduce tasks on virtual MapReduce clusters from a tenant's perspective

机译:从租户的角度安排虚拟MapReduce集群上的MapReduce任务调度

获取原文

摘要

Renting a set of virtual private servers (VPSs for short) from a VPS provider to establish a virtual MapReduce cluster is cost-efficient for a company/organization. To shorten job turnaround time and keep data locality as high as possible in this type of environment, this paper proposes a Best-Fit Task Scheduling scheme (BFTS for short) from a tenant's perspective. BFTS schedules each map task to a VPS that can finish the task earlier than the other VPSs by predicting and comparing the time required by every VPS to retrieve the map-input data, execute the map task, and become idle in an online manner. Furthermore, BFTS schedules each reduce task to a VPS that is close to most VPSs that execute the related map tasks. We conduct extensive experiments to compare BFTS with several scheduling algorithms employed by Hadoop. The experimental results show that BFTS is better than the other tested algorithms in terms of map-data locality, reduce-data locality, and job turnaround time. The overhead incurred by BFTS is also evaluated, which is inevitable but acceptable compared with the other algorithms.
机译:从VPS提供商租用一组虚拟专用服务器(简称VPS)来建立虚拟MapReduce集群对于公司/组织而言具有成本效益。为了在这种类型的环境中缩短作业周转时间并保持尽可能高的数据局部性,本文从租户的角度提出了最佳适合的任务计划方案(简称BFTS)。 BFTS通过预测和比较每个VPS检索地图输入数据,执行地图任务以及以在线方式变为空闲状态所需的时间,将每个地图任务调度到一个VPS,使其可以比其他VPS提前完成任务。此外,BFTS将每个化简任务调度到与执行相关映射任务的大多数VPS接近的VPS。我们进行了广泛的实验,以将BFTS与Hadoop所采用的几种调度算法进行比较。实验结果表明,BFTS在地图数据局部性,减少数据局部性和作业周转时间方面优于其他测试算法。还评估了BFTS产生的开销,这是不可避免的,但与其他算法相比是可以接受的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号