Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce

机译：Hadoop MapReduce的基于相似度的节点距离探索和基于位置的随机优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To shorten the networking delay from MapTracker to ReduceTracker has attractive potential to gain high performance shuffle for MapReduce. As the original MapReduce shuffle has no locality-aware feature when assigning reduce-tasks over computing nodes, we plan to present a similarity-based distance in the proposed Cloud Node Space to evaluate distance between two computing nodes in data center. Then, we implement a centralized and statistic-based locating service prior networking shuffle to place reduce-tasks near their corresponding data. Experimental results show that, comparing with Hadoop version, this service can achieve 2.3X speedup on shuffle time and bandwidth budget decreases by 60%.

机译：为了缩短从MapTracker到ReduceTracker的网络延迟，具有吸引人的潜力来获得MapReduce的高性能改组。由于当在计算节点上分配归约任务时，原始MapReduce随机播放不具有位置感知功能，因此我们计划在拟议的云节点空间中提供基于相似度的距离，以评估数据中心中两个计算节点之间的距离。然后，我们在进行网络改组之前实现了一个集中的，基于统计信息的定位服务，以将约简任务放置在它们对应的数据附近。实验结果表明，与Hadoop版本相比，该服务可将洗牌时间提高2.3倍，带宽预算减少60％。

著录项

来源
《IEEE International Conference on Smart Cloud》|2017年|103-108|共6页
会议地点
作者
Jihe Wang; Danghui Wang; Meng Zhang; Meikang Qiu; Bing Guo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bandwidth; Mathematical model; Resistance; Topology; Optical switches; Cloud computing;

机译：带宽;数学模型;电阻;拓扑;光开关;云计算;

相似文献

外文文献
中文文献
专利

1. Phase-Reconfigurable Shuffle Optimization for Hadoop MapReduce [J] . Jihe Wang, Meikang Qiu, Bing Guo, Cloud Computing, IEEE Transactions on . 2020,第2期

机译：Hadoop MapReduce的相位可重新配置的Shuffle优化
2. SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters [J] . Rong Gu, Xiaoliang Yang, Jinshuang Yan, Journal of Parallel and Distributed Computing . 2014,第3期

机译：SHadoop：通过优化Hadoop集群中的作业执行机制来提高MapReduce性能
3. Hadoop Mapreduce Performance Enhancement Using In-Node Combiners [J] . Woo-Hyun Lee, Hee-Gook Jun, Hyoung-Joo Kim International Journal of Computer Science & Information Technology (IJCSIT) . 2015,第5期

机译：使用节点内合并器增强Hadoop Mapreduce性能
4. Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce [C] . Jihe Wang, Danghui Wang, Meng Zhang, IEEE International Conference on Smart Cloud . 2017

机译：基于相似的节点距离探索和占地面积浏览Hadoop MapReduce的Shuffle优化
5. ST-Hadoop: A MapReduce Framework for Big Spatio-Temporal Data Management [D] . Alarabi, Louai. 2019

机译：St-Hadoop：大型时空数据管理的MapReduce框架
6. FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy [O] . Umberto Ferraro Petrillo, Francesco Palini, Giuseppe Cattaneo, 2021

机译：Fasta / Q数据压缩机用于Mapreduce-Hadoop基因组学：空间和时间储蓄变得简单
7. Hadoop Mapreduce Performance Enhancement Using In-Node Combiners [O] . Woo-Hyun Lee, Hee-Gook Jun, Hyoung-Joo Kim 2015

机译：Hadoop MapReduce使用Node-Node Combiners的性能增强

Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce

摘要

著录项

相似文献

相关主题

期刊订阅