首页> 外文期刊>Simulation >Parallel Performance Modeling using a Genetic Programming-based Error Correction Procedure
【24h】

Parallel Performance Modeling using a Genetic Programming-based Error Correction Procedure

机译:使用基于遗传程序的纠错程序进行并行性能建模

获取原文
获取原文并翻译 | 示例
       

摘要

Performance models of high performance computing (HPC) applications are important for several reasons. First, they provide insight to designers of HPC systems on the role of subsystems such as the processor or the network in determining application performance. Second, they allow HPC centers more accurately to target procurements to resource requirements. Third, they can be used to identify application performance bottlenecks and to provide insights about scalability issues. The suitability of a performance model, however, for a particular performance investigation is a function of both the accuracy and the cost of the model. A semi-empirical model previously published by the authors for an astrophysics application was shown to be inaccurate when predicting communication cost for large numbers of processors. It is hypothesized that this deficiency is due to the inability of the model adequately to capture communication contention (threshold effects) as well as other unmodeled components such as noise and I/O contention. In this paper we present a new approach to capture these unknown features to improve the predictive capabilities of the model. This approach uses a systematic model error-correction procedure that uses evolutionary algorithms to find an error correction term to augment the existing model. Four variations of this procedure were investigated and all were shown to produce better results than the original model. Successful cross-platform application of this approach showed that it adequately captures machine dependent characteristics. This approach was then successfully demonstrated for a second application, further showing its versatility.
机译:高性能计算(HPC)应用程序的性能模型很重要,原因有几个。首先,它们为HPC系统的设计人员提供了有关子系统(例如处理器或网络)在确定应用程序性能方面的作用的见解。其次,它们使HPC中心可以更准确地将采购目标对准​​资源需求。第三,它们可用于识别应用程序性能瓶颈并提供有关可伸缩性问题的见解。但是,性能模型对特定性能调查的适用性是模型准确性和成本的函数。结果表明,作者先前为天体物理学应用发布的半经验模型在预测大量处理器的通信成本时是不准确的。据推测,这种缺陷是由于该模型无法充分捕获通信争用(阈值效应)以及其他未建模的组件(如噪声和I / O争用)所致。在本文中,我们提出了一种捕获这些未知特征的新方法,以提高模型的预测能力。该方法使用系统模型的纠错程序,该程序使用进化算法来查找纠错项以扩充现有模型。研究了此程序的四个变体,所有变体均显示出比原始模型更好的结果。这种方法成功的跨平台应用表明,它可以充分捕获机器相关的特性。然后,该方法成功用于第二个应用程序,进一步证明了其多功能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号