首页> 外文会议>IEEE International Conference on Smart Cloud >Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce
【24h】

Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce

机译:基于相似的节点距离探索和占地面积浏览Hadoop MapReduce的Shuffle优化

获取原文

摘要

To shorten the networking delay from MapTracker to ReduceTracker has attractive potential to gain high performance shuffle for MapReduce. As the original MapReduce shuffle has no locality-aware feature when assigning reduce-tasks over computing nodes, we plan to present a similarity-based distance in the proposed Cloud Node Space to evaluate distance between two computing nodes in data center. Then, we implement a centralized and statistic-based locating service prior networking shuffle to place reduce-tasks near their corresponding data. Experimental results show that, comparing with Hadoop version, this service can achieve 2.3X speedup on shuffle time and bandwidth budget decreases by 60%.
机译:要缩短MapTracker的网络延迟,以ReduceCtacker具有吸引力的潜力,以获得MapReduce的高​​性能随机播放。由于原始MapReduce Shuffle在计算节点上分配缩小任务时没有位置感知功能,我们计划在所提出的云节点空间中呈现基于相似性的距离,以评估数据中心的两个计算节点之间的距离。然后,我们实现了基于集中的基于统计的定位服务,先前的网络洗机,以将减少任务放置在其相应的数据附近。实验结果表明,与Hadoop版本相比,这项服务可以实现2.3倍的加速时间和带宽预算减少60 %。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号