首页> 外文会议>International conference on Euro-Par >Gunther: Search-Based Auto-Tuning of MapReduce
【24h】

Gunther: Search-Based Auto-Tuning of MapReduce

机译:Gunther:基于搜索的MapReduce自动调整

获取原文

摘要

MapReduce has emerged as a very popular programming model for large-scale data analytics. Despite its industry-wide acceptance, the open source Apache™ Hadoop™ framework for MapReduce remains difficult to optimize, particularly in large-scale production environments. The vast search space defined by the hundreds of MapReduce configuration parameters and the complex interactions between them makes it time consuming to rely on manual tuning. Hence something more is needed. In this paper we evaluate approaches to the automatic tuning of Hadoop MapReduce including ones based on cost-based and machine learning models. We determine that they are inadequate and instead propose a search-based approach called Gunther for Hadoop MapReduce optimization. Gunther uses a Genetic Algorithm which is specially designed to aggressively identify parameter settings that result in near-optimal job execution time. We evaluate Gunther on two types of clusters with different resource characteristics. Our experiments demonstrate that Gunther can obtain near-optimal performance within a small number of trials (<30), outperforming existing auto-tuning'solutions and industry recommended configurations. We also describe a methodology for reducing the dimensionality of the auto-tuning problem, further improving search efficiency without sacrificing performance improvement.
机译:MapReduce已经成为大规模数据分析的一种非常流行的编程模型。尽管已为业界所接受,但MapReduce的开源Apache™Hadoop™框架仍然难以优化,尤其是在大规模生产环境中。由数百个MapReduce配置参数定义的巨大搜索空间以及它们之间的复杂交互使得依赖手动调整非常耗时。因此,还需要更多。在本文中,我们评估了Hadoop MapReduce自动调整的方法,包括基于成本和机器学习模型的方法。我们确定它们不足,而是为Hadoop MapReduce优化提出了一种名为Gunther的基于搜索的方法。 Gunther使用遗传算法,该遗传算法专门设计用于主动识别参数设置,从而使作业执行时间接近最佳。我们在具有不同资源特征的两种类型的集群上评估Gunther。我们的实验表明,Gunther在少量试验(<30)中可以获得接近最佳的性能,优于现有的“自动调整”解决方案和行业推荐的配置。我们还描述了一种方法,用于减少自动调整问题的维数,进一步提高搜索效率而又不牺牲性能的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号