Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud

机译：私有云中Hadoop MapReduce性能预测模型的比较和改进

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Performance modeling for MapReduce applications with large-scale data is a very important issue in the study of optimization, evaluation, prediction and resource scheduling of the jobs over big data and cloud computing platforms. In this paper, we study the Hadoop distributed computing framework, which is the current trend of Big Data solutions. We use the locally weighted linear regression (LWLR) algorithm and linear regression (LR) algorithm to establish three kinds of computing models based on different characteristics to estimate the execution time of the applications that have large-scale data and run on the Hadoop framework, and at the same time we make comparison and improvement to the three models. By building different types of experimental environments, and running different types of jobs, we can draw a conclusion that all the three models have very good results in predicting the execution time and evaluating the performance of large-scale data applications with small-scale data.

机译：在具有大数据和云计算平台的作业的优化，评估，预测和资源调度的研究中，具有大规模数据的MapReduce应用程序的性能建模是一个非常重要的问题。在本文中，我们研究了Hadoop分布式计算框架，这是大数据解决方案的当前趋势。我们使用局部加权线性回归（LWLR）算法和线性回归（LR）算法，根据不同的特征建立三种计算模型，以估算具有大规模数据并在Hadoop框架上运行的应用程序的执行时间，同时我们对这三个模型进行了比较和改进。通过构建不同类型的实验环境并运行不同类型的作业，我们可以得出结论，这三种模型在预测执行时间和评估具有小规模数据的大规模数据应用程序的性能方面均具有非常好的效果。

著录项

来源
《Asia-Pacific services computing conference》|2016年|77-91|共15页
会议地点 Zhangjiajie(CN)
作者
Nini Wang; Jian Yang; Zhihui Lu; Xiaoyan Li; Jie Wu;
展开▼
作者单位

School of Computer Science Fudan University Shanghai 200433 China;

Engineering Research Center of Cyber Security Auditing and Monitoring Ministry of Education Shanghai 200433 China;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Big data; Hadoop; Private cloud; Mapreduce; Performance prediction model; Job estimation;

机译：大数据; Hadoop;私有云； Mapreduce;绩效预测模型；工作估计;

相似文献

外文文献
中文文献
专利

1. Using Hadoop MapReduce for Parallel Genetic Algorithms: A Comparison of the Global, Grid and Island Models [J] . Ferrucci Filomena, Salza Pasquale, Sarro Federica Evolutionary computation . 2018,第4期

机译：使用Hadoop MapReduce进行并行遗传算法：全局模型，网格模型和孤岛模型的比较
2. Sports performance prediction model based on integrated learning algorithm and cloud computing Hadoop platform [J] . Zhu Haiyun, Xu Yizhe Microprocessors and microsystems . 2020,第Nova期

机译：基于综合学习算法和云计算Hadoop平台的体育绩效预测模型
3. A Study and Performance Comparison of MapReduce and Apache Spark on Twitter Data on Hadoop Cluster [J] . Nowraj Farhan, Ahsan Habib, Arshad Ali International Journal of Information Technology and Computer Science . 2018,第7期

机译：Hadoop集群上Twitter数据上MapReduce和Apache Spark的研究和性能比较
4. Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud [C] . Nini Wang, Jian Yang, Zhihui Lu, Asia-Paciﬁc Services Computing Conference . 2016

机译：私有云中Hadoop MapReduce性能预测模型的比较与改进
5. 1-D simulation of HCCI engine performance using knock-integral ignition prediction with Wiebe function combustion modeling, and comparison to advanced SI engine performance. [D] . Huisjen, Andrew Michael. 2010

机译：使用带有Wiebe函数燃烧模型的爆震积分点火预测对HCCI发动机性能进行一维模拟，并与先进的SI发动机性能进行比较。
6. Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies [O] . Ahmed Abdulhakim Al-Absi, Najeeb Abbas Al-Sammarraie, Wael Mohamed Shaher Yafooz, -1

机译：并行MapReduce：使用并行执行策略来最大程度地利用云资源并提高性能
7. A Combined Analytical Modeling Machine Learning Approach for Performance Prediction of MapReduce Jobs in Hadoop Clusters [O] . Ataie Ehsan, Gianniti Eugenio, Ardagna Danilo, 2016

机译：Hadoop集群中MapReduce作业性能预测的组合分析建模机器学习方法

Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud

摘要

著录项

相似文献

相关主题

期刊订阅