首页> 外文会议>Asia-Pacific services computing conference >Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud
【24h】

Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud

机译:私有云中Hadoop MapReduce性能预测模型的比较和改进

获取原文

摘要

Performance modeling for MapReduce applications with large-scale data is a very important issue in the study of optimization, evaluation, prediction and resource scheduling of the jobs over big data and cloud computing platforms. In this paper, we study the Hadoop distributed computing framework, which is the current trend of Big Data solutions. We use the locally weighted linear regression (LWLR) algorithm and linear regression (LR) algorithm to establish three kinds of computing models based on different characteristics to estimate the execution time of the applications that have large-scale data and run on the Hadoop framework, and at the same time we make comparison and improvement to the three models. By building different types of experimental environments, and running different types of jobs, we can draw a conclusion that all the three models have very good results in predicting the execution time and evaluating the performance of large-scale data applications with small-scale data.
机译:在具有大数据和云计算平台的作业的优化,评估,预测和资源调度的研究中,具有大规模数据的MapReduce应用程序的性能建模是一个非常重要的问题。在本文中,我们研究了Hadoop分布式计算框架,这是大数据解决方案的当前趋势。我们使用局部加权线性回归(LWLR)算法和线性回归(LR)算法,根据不同的特征建立三种计算模型,以估算具有大规模数据并在Hadoop框架上运行的应用程序的执行时间,同时我们对这三个模型进行了比较和改进。通过构建不同类型的实验环境并运行不同类型的作业,我们可以得出结论,这三种模型在预测执行时间和评估具有小规模数据的大规模数据应用程序的性能方面均具有非常好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号