首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops and PhD Forum >Inferring Large-Scale Computation Behavior via Trace Extrapolation
【24h】

Inferring Large-Scale Computation Behavior via Trace Extrapolation

机译:通过跟踪外推推断大规模计算行为

获取原文

摘要

Understanding large-scale application behavior is critical for effectively utilizing existing HPC resources and making design decisions for upcoming systems. In this work we present a methodology for characterizing an MPI application's large-scale computation behavior and system requirements using information about the behavior of that application at a series of smaller core counts. The methodology finds the best statistical fit from among a set of canonical functions in terms of how a set of features that are important for both performance and energy (cache hit rates, floating point intensity, ILP, etc.) change across a series of small core counts. The statistical models for each of these application features can then be utilized to generate an extrapolated trace of the application at scale. The fidelity of the fully extrapolated traces is evaluated by comparing the results of building performance models using both the extrapolated trace along with an actual trace in order to predict application performance at using each. For two full-scale HPC applications, SPECFEM3D and UH3D, the extrapolated traces had absolute relative errors of less than 5%.
机译:了解大规模应用程序的行为对于有效利用现有HPC资源并为即将到来的系统做出设计决策至关重要。在这项工作中,我们提出了一种方法,该方法用于使用一系列较小核心数量的有关该应用程序行为的信息来表征MPI应用程序的大规模计算行为和系统需求。该方法论从一组规范函数中找到最佳的统计拟合,这取决于对一系列性能和能量(缓存命中率,浮点强度,ILP等)都重要的一组功能如何在一系列小的范围内变化核心计数。然后,可以使用这些应用程序功能中每个功能的统计模型来按比例生成应用程序的外推轨迹。通过比较使用外推迹线和实际迹线的建筑性能模型的结果来评估完全外推迹线的保真度,以便预测使用每个时的应用程序性能。对于两个完整的HPC应用SPECFEM3D和UH3D,外推迹线的绝对相对误差小于5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号