首页> 外文会议>IEEE International Conference on Cloud Computing and Intelligent Systems >Performance modeling and optimization of MapReduce programs
【24h】

Performance modeling and optimization of MapReduce programs

机译:MapReduce程序的性能建模和优化

获取原文
获取外文期刊封面目录资料

摘要

MapReduce is a developer-friendly framework that encapsulates the underlying complexities of distributed computing. It is increasingly being used across enterprises for advanced data analytics, business intelligence, and data mining tasks. But there are two questions bothering Hadoop users: how to improve the performance of MapReduce workloads, and how to estimate the time needed to run a MapReduce job. In this paper, we provide some performance optimization techniques on the premise of workload characterization. After the cluster achieving the best performance, we further propose a modeling method to help Hadoop users estimate the execution time of MapReduce jobs. For evaluation, typical benchmarks are utilized to evaluate the accuracy of our techniques.
机译:MapReduce是一个对开发人员友好的框架,其中封装了分布式计算的基础复杂性。它在企业中越来越多地用于高级数据分析,商业智能和数据挖掘任务。但是有两个问题困扰着Hadoop用户:如何提高MapReduce工作负载的性能,以及如何估算运行MapReduce作业所需的时间。在本文中,我们在工作负载表征的前提下提供了一些性能优化技术。在集群达到最佳性能之后,我们进一步提出了一种建模方法,以帮助Hadoop用户估计MapReduce作业的执行时间。为了进行评估,典型的基准被用来评估我们技术的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号