首页> 外文期刊>ACM Transactions on Programming Languages and Systems >ScalaExtrap: Trace-Based Communication Extrapolation for SPMD Programs
【24h】

ScalaExtrap: Trace-Based Communication Extrapolation for SPMD Programs

机译:ScalaExtrap:SPMD程序的基于跟踪的通信外推

获取原文
获取原文并翻译 | 示例
           

摘要

Performance modeling for scientific applications is important for assessing potential application performance and systems procurement in high-performance computing (HPC). Recent progress on communication tracing opens up novel opportunities for communication modeling due to its lossless yet scalable trace collection. Estimating the impact of scaling on communication efficiency still remains nontrivial due to execution-time variations and exposure to hardware and software artifacts. This work contributes a fundamentally novel modeling scheme. We synthetically generate the application trace for large numbers of nodes via extrapolation from a set of smaller traces. We devise an innovative approach for topology extrapolation of single program, multiple data (SPMD) codes with stencil or mesh communication. Experimental results show that the extrapolated traces precisely reflect the communication behavior and the performance characteristics at the target scale for both strong and weak scaling applications. The extrapolated trace can subsequently be (a) replayed to assess communication requirements before porting an application, (b) transformed to autogenerate communication benchmarks for various target platforms, and (c) analyzed to detect communication inefficiencies and scalability limitations. To the best of our knowledge, rapidly obtaining the communication behavior of parallel applications at arbitrary scale with the availability of timed replay, yet without actual execution of the application, at this scale, is without precedence and has the potential to enable otherwise infeasible system simulation at the exascale level.
机译:科学应用程序的性能建模对于评估高性能计算(HPC)中潜在的应用程序性能和系统采购非常重要。由于其无损但可伸缩的跟踪收集,通信跟踪的最新进展为通信建模打开了新的机会。由于执行时间的变化以及暴露于硬件和软件工件的影响,估计扩展对通信效率的影响仍然很重要。这项工作为根本上新颖的建模方案做出了贡献。我们通过从一组较小的跟踪中进行推断来综合生成大量节点的应用程序跟踪。我们设计了一种创新的方法,可通过模板或网状通信对单个程序,多个数据(SPMD)代码进行拓扑外推。实验结果表明,无论是强缩放应用还是弱缩放应用,外推迹线都能准确反映目标规模的通信行为和性能特征。随后可以外推跟踪(a)在移植应用程序之前重放以评估通信要求,(b)转换为自动生成各种目标平台的通信基准,以及(c)进行分析以检测通信效率低下和可伸缩性限制。据我们所知,以定时重播的速度快速获得任意规模的并行应用程序的通信行为,却没有实际执行此规模的应用程序是没有先例的,并且有可能实现否则无法进行的系统仿真在亿亿级别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号