首页> 外文会议>International Conference on Distributed Computing and Internet Technology >Load Balancing Approach for a MapReduce Job Running on a Heterogeneous Hadoop Cluster
【24h】

Load Balancing Approach for a MapReduce Job Running on a Heterogeneous Hadoop Cluster

机译:在异构Hadoop集群上运行MapReduce作业的负载平衡方法

获取原文

摘要

Hadoop MapReduce has become the de-facto standard in today's Big data world to process the more prominent data sets on a distributed cluster of commodity hardware. Today computing nodes in a commodity cluster do not have the same hardware configuration, which leads to heterogeneity. Heterogeneity has become common in the industry, research institutes, and academics. Our study shows that the current rules for calculating the required number of Reduce tasks (Reducers) for a MapReduce job are fallacious, leading to significant computing resources' overutilization. It also degrades MapReduce job performance running on a heterogeneous Hadoop cluster. However, there is no definite answer to the question: What is the optimal number of Reduce tasks required for a MapReduce job to get Hadoop's most accomplished performance in a heterogeneous cluster? We have proposed a new rule that decides the required number of reduce tasks for a MapReduce job running on a heterogeneous Hadoop cluster accurately. The proposed rule balances the load among the heterogeneous nodes in the Reduce phase of MapReduce. It also minimizes computing resources' overutilization and improves the MapReduce job execution time by an average of 18% and 28% for TeraSort and PageRank applications running on a heterogeneous Hadoop cluster.
机译:Hadoop MapReduce已成为当今大数据世界的De-Facto标准,以处理在分布式商品硬件集群上更加突出的数据集。今天,商品集群中的计算节点没有相同的硬件配置,这导致异质性。异质性在行业,研究机构和学术界中变得普遍。我们的研究表明,计算用于MapReduce作业所需的减少任务数(Reducers)的当前规则是谬谬的,导致显着的计算资源过抵制。它还降低了在异构Hadoop集群上运行的MapReduce作业性能。但是,问题没有明确的答案:MapReduce作业所需的最佳衰减任务是什么是在异构群集中获得Hadoop最成就的性能的最佳衰减任务?我们提出了一种新规则,可以准确地决定MapReduce作业的所需任务数量。所提出的规则平衡了MapReduce的减少阶段的异构节点之间的负载。它还最大限度地减少了计算资源的冻融化,并将MapReduce作业执行时间提高了在异构Hadoop集群上运行的Terasort和PageRank应用程序的18%和28%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号