A Collaborative Filtering based Approach to Performance Prediction for Parallel Applications

机译：基于协同过滤的并行应用的性能预测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallel application jobs account for a large population in current domain of cloud computing and Big Data processing services, whose execution time can be varied greatly with different runtime configurations. For efficiently scheduling resources and services to run parallel jobs, the ability to quickly and accurately estimate the performance of parallel applications is critical. Analytic predictive models based on traditional modeling techniques such as queuing systems are difficult to construct for parallel applications, due to the high complexity lying in the structures of parallel application models. Furthermore, due to the heterogeneity of resources computing capacities with a scalable computing environment such as a cloud computing platform, performance analytic and prediction becomes increasingly difficult for parallel applications. To address this problem, in this paper we propose a collaborative filtering based approach to quickly and accurately predict the execution time of parallel applications running in heterogenous resources. Particularly, we use the widely used Apache Spark platform as the running framework for parallel applications, and propose a bounds-based performance model to improve the prediction accuracy. Through extensive simulations and experiments on real Spark clusters and two large-scale machine learning applications as well as the simple but classic WordCount sample application, we show that the proposed Collaborative Filtering based approach and bounds-based performance model can accurately estimate the performance of parallel applications.

机译：并行应用职位占云计算和大数据处理服务的当前域中的大量人口，其执行时间可以通过不同的运行时配置大大变化。为了有效调度资源和服务来运行并行作业，快速准确估计并行应用程序性能的能力至关重要。基于传统建模技术的分析预测模型，例如排队系统难以构建用于并行应用的并行应用，这是由于并行应用模型的结构的高复杂性。此外，由于资源计算能力的资源计算能力，例如云计算平台，性能分析和预测对于并行应用越来越困难。为了解决这个问题，在本文中，我们提出了一种基于协同过滤的方法来快速准确地预测在异构资源中运行的并行应用的执行时间。特别是，我们使用广泛使用的Apache Spark平台作为并行应用程序的运行框架，并提出基于界限的性能模型来提高预测精度。通过广泛的模拟和实验在真正的火花群和两个大型机器学习应用以及简单但经典的Wordcount应用程序中，我们表明所提出的基于协作过滤的方法和基于界限的性能模型可以准确估计并行性能应用程序。

著录项

来源
《IEEE International Conference on Computer Supported Cooperative Work in Design》|2017年|594 p. :|共6页
会议地点
作者
Qingshi Shao; Li Pan; Shijun Liu; Xinyan Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB21-532;
关键词
parallel application; Big Data; performance prediction; collaborative filtering; Spark;

机译：并行应用;大数据;性能预测;协作过滤;火花;

相似文献

外文文献
中文文献
专利

1. Neighbor Selection and Weighting in User-Based Collaborative Filtering: A Performance Prediction Approach [J] . ALEJANDRO BELLOGIN, PABLO CASTELLS, IVAN CANTADOR ACM transactions on the web . 2014,第2期

机译：基于用户的协同过滤中的邻居选择和权重：一种性能预测方法
2. A Content-Boosted Collaborative Filtering Approach for Movie Recommendation Based on Local and Global Similarity and Missing Data Prediction [J] . GOZDE OZBAL, HlLAL KARAMAN, FERDA N. ALPASLAN The Computer journal . 2011,第9期

机译：基于局部和全局相似度以及数据丢失预测的电影推荐内容增强协同过滤方法
3. A Content-Boosted Collaborative Filtering Approach for Movie Recommendation Based on Local and Global Similarity and Missing Data Prediction [J] . Gözde Özbal, Hılal Karaman, Ferda N. Alpaslan Computer Journal, The . 2011,第9期

机译：基于局部和全局相似度以及数据丢失预测的电影推荐内容增强协同过滤方法
4. A Collaborative Filtering based Approach to Performance Prediction for Parallel Applications [C] . Qingshi Shao, Li Pan, Shijun Liu, IEEE International Conference on Computer Supported Cooperative Work in Design . 2017

机译：基于协同过滤的并行应用的性能预测方法
5. Enhancing Collaborative Filtering-Based Rating-Prediction by Discovering and Incorporating User Concerns from User Reviews [D] . Pradhan, Ligaj. 2017

机译：通过发现和纳入用户评论中的用户关注点来增强基于协作过滤的评分预测
6. A Personalized QoS Prediction Approach for CPS Service Recommendation Based on Reputation and Location-Aware Collaborative Filtering [O] . Li Kuang, Long Yu, Lan Huang, 2018

机译：基于信誉和位置感知协同过滤的CPS服务推荐个性化QoS预测方法
7. Neighbor selection and weighting in user-based collaborative filtering: A performance prediction approach [O] . Bellogín, Alejandro, Castells, Pablo, Cantador, Iván 2014

机译：基于用户的协同过滤中的邻居选择和加权：性能预测方法

A Collaborative Filtering based Approach to Performance Prediction for Parallel Applications

摘要

著录项

相似文献

相关主题

期刊订阅