首页> 外文会议>IEEE International Conference on Computer Supported Cooperative Work in Design >A Collaborative Filtering based Approach to Performance Prediction for Parallel Applications
【24h】

A Collaborative Filtering based Approach to Performance Prediction for Parallel Applications

机译:基于协同过滤的并行应用的性能预测方法

获取原文

摘要

Parallel application jobs account for a large population in current domain of cloud computing and Big Data processing services, whose execution time can be varied greatly with different runtime configurations. For efficiently scheduling resources and services to run parallel jobs, the ability to quickly and accurately estimate the performance of parallel applications is critical. Analytic predictive models based on traditional modeling techniques such as queuing systems are difficult to construct for parallel applications, due to the high complexity lying in the structures of parallel application models. Furthermore, due to the heterogeneity of resources computing capacities with a scalable computing environment such as a cloud computing platform, performance analytic and prediction becomes increasingly difficult for parallel applications. To address this problem, in this paper we propose a collaborative filtering based approach to quickly and accurately predict the execution time of parallel applications running in heterogenous resources. Particularly, we use the widely used Apache Spark platform as the running framework for parallel applications, and propose a bounds-based performance model to improve the prediction accuracy. Through extensive simulations and experiments on real Spark clusters and two large-scale machine learning applications as well as the simple but classic WordCount sample application, we show that the proposed Collaborative Filtering based approach and bounds-based performance model can accurately estimate the performance of parallel applications.
机译:并行应用职位占云计算和大数据处理服务的当前域中的大量人口,其执行时间可以通过不同的运行时配置大大变化。为了有效调度资源和服务来运行并行作业,快速准确估计并行应用程序性能的能力至关重要。基于传统建模技术的分析预测模型,例如排队系统难以构建用于并行应用的并行应用,这是由于并行应用模型的结构的高复杂性。此外,由于资源计算能力的资源计算能力,例如云计算平台,性能分析和预测对于并行应用越来越困难。为了解决这个问题,在本文中,我们提出了一种基于协同过滤的方法来快速准确地预测在异构资源中运行的并行应用的执行时间。特别是,我们使用广泛使用的Apache Spark平台作为并行应用程序的运行框架,并提出基于界限的性能模型来提高预测精度。通过广泛的模拟和实验在真正的火花群和两个大型机器学习应用以及简单但经典的Wordcount应用程序中,我们表明所提出的基于协作过滤的方法和基于界限的性能模型可以准确估计并行性能应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号