首页> 外文期刊>ACM Transactions on Modeling and Computer Simulation >New Performance Modeling Methods for Parallel Data Processing Applications
【24h】

New Performance Modeling Methods for Parallel Data Processing Applications

机译:并行数据处理应用程序的新性能建模方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Predicting the performance of an application running on parallel computing platforms is increasingly becoming important because of its influence on development time and resource management. However, predicting the performance with respect to parallel processes is complex for iterative and multi-stage applications. This research proposes a performance approximation approach FiM to predict the calculation time with FiM-Cal and communication time with FiM-Com of an application running on a distributed framework. FiM-Cal consists of two key components that are coupled with each other: (1) a Stochastic. Markov Model to capture non-deterministic runtime that often depends on parallel resources, e.g., number of processes, and (2) a machine-learning model that extrapolates the parameters for calibrating our Markov model when we have changes in application parameters such as dataset. Along with the parallel calculation time, parallel computing platforms consume some data transfer time to communicate among different nodes. FiM-Com consists of a simulation queuing model to quickly estimate communication time. Our new modeling approach considers different design choices along multiple dimensions, namely (i) process-level parallelism, (ii) distribution of cores on multi-processor platform, (iii) application related parameters, and (iv) characteristics of datasets. The major contribution of our prediction approach is that FiM can provide an accurate prediction of parallel processing time for the datasets that have a much larger size than that of the training datasets. We evaluate our approach with NAS Parallel Benchmarks and real iterative data processing applications. We compare the predicted results (e.g., end-to-end execution time) with actual experimental measurements on a real distributed platform. We also compare our work with an existing prediction technique based on machine learning. We rank the number of processes according to the actual and predicted results from FLM and calculate the correlation between the actual and predicted rankings. Our results show that FiM obtains a high correlation in the range of 0.80 to 0.99, which indicates considerable accuracy of our technique. Such prediction provides data analysts a useful insight of optimal configuration of parallel resources (e.g., number of processes and number of cores) and also helps system designers to investigate the impact of changes in application parameters on system performance.
机译:预测在并行计算平台上运行的应用程序的性能变得越来越重要,因为它会影响开发时间和资源管理。但是,对于迭代和多阶段应用程序,相对于并行过程预测性能很复杂。这项研究提出了一种性能近似方法FiM来预测在分布式框架上运行的应用程序的FiM-Cal计算时间和与FiM-Com的通信时间。 FiM-Cal由两个相互联系的关键组成部分组成:(1)随机指标。马尔可夫模型以捕获通常取决于并行资源(例如进程数)的非确定性运行时,以及(2)机器学习模型,当我们对应用程序参数(例如数据集)进行更改时,该模型会外推参数以校准我们的马尔可夫模型。随着并行计算时间的增加,并行计算平台会花费一些数据传输时间来在不同节点之间进行通信。 FiM-Com包含一个仿真排队模型,可快速估算通信时间。我们的新建模方法在多个维度上考虑了不同的设计选择,即(i)进程级并行性,(ii)多处理器平台上的内核分布,(iii)与应用程序相关的参数以及(iv)数据集的特征。我们的预测方法的主要贡献在于,FiM可以为尺寸比训练数据集大得多的数据集提供并行处理时间的准确预测。我们使用NAS并行基准和实际的迭代数据处理应用程序评估我们的方法。我们将预测结果(例如端到端执行时间)与真实分布式平台上的实际实验测量值进行比较。我们还将我们的工作与基于机器学习的现有预测技术进行比较。我们根据FLM的实际和预测结果对进程数进行排名,并计算实际和预测排名之间的相关性。我们的结果表明,FiM在0.80至0.99的范围内获得了很高的相关性,这表明我们的技术具有相当大的准确性。这种预测为数据分析人员提供了有关并行资源最佳配置(例如,进程数和内核数)的有用见解,还有助于系统设计人员研究应用程序参数变化对系统性能的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号