首页> 外文会议>International Conference on Computational Intelligence and Communication Networks >Minimizing Skew in MapReduce Applications Using Node Clustering in Heterogeneous Environment
【24h】

Minimizing Skew in MapReduce Applications Using Node Clustering in Heterogeneous Environment

机译:使用异构环境中的节点群集将MapReduce应用程序中的偏斜最小化

获取原文

摘要

We present an automatic skew minimization approach defined for MapReduce programs and present proposed system that implements this approach as a replacement for an existing MapReduce implementation. The proposed system addresses these challenges and works as follows: From intermediate output from map tasks data skew present in records to solve this problem we create two sets of node mainly of high and low computational power nodes and assign the skewed records to the high computational power nodes and remaining to the low computational power nodes to process further reduce task. We implement proposed system as an extension to Hadoop and evaluate its effectiveness using real applications. The results show that proposed system can reduce job runtime in the presence of skew and adds little to no overhead in the absence of skew in heterogeneous environment.
机译:我们提出了一种针对MapReduce程序定义的自动偏斜最小化方法,并提出了将这种方法实现为现有MapReduce实施方案的替代方案的拟议系统。拟议的系统解决了这些挑战,其工作方式如下:从记录任务中出现的地图任务数据偏斜的中间输出来解决此问题,我们创建了两个主要由高和低计算能力节点组成的节点集,并将偏斜的记录分配给了高计算能力节点并保留给计算能力较低的节点以进一步处理任务。我们将提议的系统实现为Hadoop的扩展,并使用实际应用程序评估其有效性。结果表明,所提出的系统可以在存在偏斜的情况下减少作业运行时间,并且在异构环境中在没有偏斜的情况下几乎没有增加开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号