首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Effective sampling-driven performance tools for GPU-accelerated supercomputers
【24h】

Effective sampling-driven performance tools for GPU-accelerated supercomputers

机译:用于GPU加速超级计算机的有效采样驱动的性能工具

获取原文
获取外文期刊封面目录资料

摘要

Performance analysis of GPU-accelerated systems requires a system-wide view that considers both CPU and GPU components. In this paper, we describe how to extend system-wide, sampling-based performance analysis methods to GPU-accelerated systems. Since current GPUs do not support sampling, our implementation required careful coordination of instrumentation-based performance data collection on GPUs with sampling-based methods employed on CPUs. In addition, we also introduce a novel technique for analyzing systemic idleness in CPU/GPU systems. We demonstrate the effectiveness of our techniques with application case studies on Titan and Keeneland. Some of the highlights of our case studies are: 1) we improved performance for LULESH 1.0 by 30%, 2) we identified a hardware performance problem on Keeneland, 3) we identified a scaling problem in LAMMPS derived from CUDA initialization, and 4) we identified a performance problem that is caused by GPU synchronization operations that suffer delays due to blocking system calls.
机译:GPU加速系统的性能分析需要一个系统范围的视图,用于考虑CPU和GPU组件。在本文中,我们介绍了如何将系统范围的,采样的基于样本的性能分析方法扩展到GPU加速系统。由于目前的GPU不支持采样,我们的实现需要仔细协调GPU上的基于仪器的性能数据收集,并采用基于CPU的方法。此外,我们还介绍了一种用于分析CPU / GPU系统的系统闲置的新技术。我们展示了技术与泰坦和克莱恩兰的应用案例研究的有效性。我们案例研究的一些亮点是:1)我们改善了Lulesh 1.0的表现为30%,2)我们确定了Keeneland的硬件性能问题,3)我们确定了来自CUDA初始化的LAMMP中的缩放问题,4)我们确定了由GPU同步操作引起的性能问题,这些操作由于阻塞系统调用而受到延迟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号