首页> 外文会议>IEEE International Conference on Industrial Informatics >Performance evaluation and tuning for MapReduce computing in Hadoop distributed file system
【24h】

Performance evaluation and tuning for MapReduce computing in Hadoop distributed file system

机译:Hadoop分布式文件系统中MapReduce计算的性能评估和调整

获取原文

摘要

This paper proposes a method to facilitate the identification process for a set of configuration parameters to achieve the optimal performance with respect to a benchmark program in HDFS in an automated manner. Performance optimization of Hadoop processes is a tedious yet challenging problem due to the complexity of the systems organization with an extensive list of configuration parameters to be considered. An Automated Benchmarking Configuration Method (ABCM) is developed in this work to facilitate the identification process for the set of configuration parameters that minimizes the execution time of a benchmark, namely TestDFSIO Write and Read in particular. A two-phased configuration parameters selection process with a simple sampling technique is proposed in order to mediate the exponential computation time otherwise. By using the proposed technique, we have automatically found the sets of top five selected optimal configuration parameters that reduced the average execution time by 32% compared to the execution time with the default set of Hadoop configuration parameters.
机译:本文提出了一种方法,该方法可简化针对一组配置参数的识别过程,以相对于HDFS中的基准程序自动实现最佳性能。由于系统组织的复杂性以及要考虑的大量配置参数,因此Hadoop流程的性能优化是一个繁琐而具有挑战性的问题。在这项工作中,开发了一种自动基准配置方法(ABCM),以简化对一组配置参数的识别过程,从而最大程度地减少了基准(特别是TestDFSIO写入和读取)的执行时间。为了调解指数计算时间,提出了一种采用简单采样技术的两阶段配置参数选择过程。通过使用所提出的技术,我们自动找到了前五名选定的最佳配置参数集,与使用默认Hadoop配置参数集的执行时间相比,这些参数将平均执行时间减少了32%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号