首页> 外文会议>2011 International Conference on Cloud and Service Computing >An effective data locality aware task scheduling method for MapReduce framework in heterogeneous environments
【24h】

An effective data locality aware task scheduling method for MapReduce framework in heterogeneous environments

机译:一种异构环境中MapReduce框架的有效的数据位置感知任务调度方法

获取原文
获取原文并翻译 | 示例

摘要

Data locality has recently been extensively exploited in Cloud computing to improve system performance. However, when schedule Map tasks in Hadoop MapReduce framework working in a heterogeneous environment, existing methods either cannot reduce the occurrence of these Map tasks or injure fairness, thus degrading the system performance. In order to address this problem, this paper proposes a data locality aware scheduling method to improve the Hadoop MapReduce system performance in heterogeneous computing environments. After receiving a request from a requesting node, our method preferentially schedules the task whose input data is stored on the requesting node. If no such tasks exist, our method will select the task whose input data is nearest to the requesting node, and then make a decision on whether to reserve the task for the node storing the input data or schedule the task to the requesting node by transferring the input data to the requesting node on the fly. As a proof of concept, we implement the method in Hadoop-0.20.2. In order to evaluate the performance, we carry out an experimental comparison study on our proposed method against the default scheduling method used in Hadoop-0.20.2. The experiment results show that our proposed method improves the data locality and reduces the normalized execution time as well as the response time of jobs.
机译:最近,数据位置已在云计算中得到广泛利用,以提高系统性能。但是,当在异构环境中的Hadoop MapReduce框架中调度Map任务时,现有方法无法减少这些Map任务的发生或损害公平性,从而降低系统性能。为了解决这个问题,本文提出了一种数据局部性感知调度方法,以提高异构计算环境中Hadoop MapReduce系统的性能。在接收到来自请求节点的请求之后,我们的方法优先调度其输入数据存储在请求节点上的任务。如果不存在这样的任务,则我们的方法将选择输入数据最接近请求节点的任务,然后决定是将任务保留给存储输入数据的节点,还是将任务调度到请求节点输入数据到请求节点的过程中。作为概念证明,我们在Hadoop-0.20.2中实现该方法。为了评估性能,我们对我们提出的方法与Hadoop-0.20.2中使用的默认调度方法进行了实验比较研究。实验结果表明,本文提出的方法提高了数据的局部性,减少了标准化的执行时间和作业的响应时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号