首页> 外文会议>IEEE International Conference on Smart Cloud >Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce
【24h】

Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce

机译:Hadoop MapReduce的基于相似度的节点距离探索和基于位置的随机优化

获取原文

摘要

To shorten the networking delay from MapTracker to ReduceTracker has attractive potential to gain high performance shuffle for MapReduce. As the original MapReduce shuffle has no locality-aware feature when assigning reduce-tasks over computing nodes, we plan to present a similarity-based distance in the proposed Cloud Node Space to evaluate distance between two computing nodes in data center. Then, we implement a centralized and statistic-based locating service prior networking shuffle to place reduce-tasks near their corresponding data. Experimental results show that, comparing with Hadoop version, this service can achieve 2.3X speedup on shuffle time and bandwidth budget decreases by 60%.
机译:为了缩短从MapTracker到ReduceTracker的网络延迟,具有吸引人的潜力来获得MapReduce的高​​性能改组。由于当在计算节点上分配归约任务时,原始MapReduce随机播放不具有位置感知功能,因此我们计划在拟议的云节点空间中提供基于相似度的距离,以评估数据中心中两个计算节点之间的距离。然后,我们在进行网络改组之前实现了一个集中的,基于统计信息的定位服务,以将约简任务放置在它们对应的数据附近。实验结果表明,与Hadoop版本相比,该服务可将洗牌时间提高2.3倍,带宽预算减少60%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号