In this paper, researching on task scheduling is a way from the perspective of resource allocation and management to improve performance of Hadoop system. I'/> A strategy for scheduling reduce task based on intermediate data locality of the MapReduce
首页> 外文期刊>Cluster computing >A strategy for scheduling reduce task based on intermediate data locality of the MapReduce
【24h】

A strategy for scheduling reduce task based on intermediate data locality of the MapReduce

机译:基于MapReduce中间数据征点的调度减少任务的策略

获取原文
获取原文并翻译 | 示例
           

摘要

AbstractIn this paper, researching on task scheduling is a way from the perspective of resource allocation and management to improve performance of Hadoop system. In order to save the network bandwidth resources in Hadoop cluster environment and improve the performance of Hadoop system, a ReduceTask scheduling strategy that based on data-locality is improved. In MapReduce stage, there are two main data streams in cluster network, they are slow task migration and remote copies of data. The two overlapping burst data transfer can easily become bottlenecks of the cluster network. To reduce the amount of remote copies of data, combining with data-locality, we establish a minimum network resource consumption model (MNRC). MNRC is used to calculate the network resources consumption of ReduceTask. Based on this model, we design a delay priority scheduling policy for the ReduceTask which is based on the cost of network resource consumption. Finally, MNRC is verified by simulation experiments. Evaluation results show that MNRC outperforms the saving cluster network resource by an average of 7.5% in heterogeneous.
机译:<标题>抽象 ara id =“par4”>在本文中,研究任务调度是一种从资源分配和管理的角度来提高Hadoop系统性能的一种方式。为了保存Hadoop集群环境中的网络带宽资源并提高Hadoop系统的性能,提高了基于数据局部性的冗余调度策略。在MapReduce阶段,群集网络中有两个主要数据流,它们是慢的任务迁移和远程数据副本。两个重叠的突发数据传输可以很容易地成为群集网络的瓶颈。为了减少数据的远程副本量,与数据局部结合,我们建立了最小网络资源消耗模型(MNRC)。 MNRC用于计算冗余的网络资源消耗。基于此模型,我们设计了用于冗余的延迟优先级调度策略,其基于网络资源消耗的成本。最后,通过模拟实验验证MNRC。评估结果表明,MNRC在异构的平均值为7.5%的情况下优于节省群集网络资源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号