首页> 外文会议>International Conference on High Performance Computing and Simulation >Roofline Scaling Trajectories: A Method for Parallel Application and Architectural Performance Analysis
【24h】

Roofline Scaling Trajectories: A Method for Parallel Application and Architectural Performance Analysis

机译:Roadline缩放轨迹:一种并行应用和架构性能分析的方法

获取原文

摘要

The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built from single- core processor architectures to systems built from multicore and eventually manycore architectures. This transition substantially complicated performance optimization and analysis as new programming models were created, new scaling methodologies deployed, and on-chip contention became a bottleneck to performance. Existing distributed memory performance models like logP and logGP were unable to capture this contention. The Roofline model was created to address this contention and its interplay with locality. However, to date, the Roofline model has focused on full-node concurrency. In this paper, we extend the Roofline model to capture the effects of concurrency on data locality and on-chip contention. We demonstrate the value of this new technique by evaluating the NAS parallel benchmarks on both multicore and manycore architectures under both strong-and weak-scaling regimes. In order to quantify the interplay between programming model and locality, we evaluate scaling under both the OpenMP and flat MPI programming models.
机译:Dennard缩放的末尾通过从单核处理器架构构建的系统的HPC超级计算机架构中的转变为来自Multicore和最终多核体系结构构建的系统。这种过渡基本上复杂的性能优化和分析是创建了新的编程模型,部署了新的缩放方法,片上争用成为表现的瓶颈。现有的分布式内存性能模型如logp和loggp无法捕获此争用。创建屋顶模型,以解决此争用及其与地方的相互作用。但是,迄今为止,Royline模型集中在全节点并发上。在本文中,我们扩展了屋顶模型,以捕获并发性对数据位置和片上争用的影响。我们通过在强大和弱缩小制度下评估多核和多核架构的NAS并行基准来展示这种新技术的价值。为了量化编程模型和局部性之间的相互作用,我们在OpenMP和Flat MPI编程模型下评估缩放。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号