首页> 外文会议>IEEE Student Conference on Research and Development >Performance Analysis of the Effect of a Combiner on a MapReduce Job
【24h】

Performance Analysis of the Effect of a Combiner on a MapReduce Job

机译:Combiner对MapReduce工作的效果分析

获取原文

摘要

MapReduce has been widely deployed as the most efficient framework for big data processing due to its ability to run on commodity hardware as well as the ability to automatically and effectively manage parallel execution of tasks. During the shuffle phase, a lot of data traffic is generated which consumes a lot of bandwidth and in turn, leads to performance degradation. Many efforts have been made to reduce the data traffic during the shuffle phase, with the common one being the use of a combiner function which is default in the Hadoop framework. This paper presents a performance analysis of the effect of a combiner function on the reduce times and reduce shuffle bytes while varying the number of reduce tasks. The results of the analysis show that the combiner significantly reduces the reduce times as well as the reduce shuffle bytes.
机译:MapReduce已被广泛部署为大数据处理的最有效框架,因为它可以在商品硬件上运行的能力以及自动和有效地管理并行执行任务的能力。在Shuffle阶段,生成大量数据流量,这消耗了大量带宽,然后又导致性能下降。已经进行了许多努力来减少随机阶段期间的数据流量,其中一个是在Hadoop框架中使用默认的组合功能。本文介绍了组合器功能对缩减时间的效果的性能分析,并在改变减少任务的数量时减少洗牌字节。分析结果表明,组合器显着降低了缩小时间以及减少洗牌字节。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号