首页> 外文会议>International conference on high performance computing workshops >Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks
【24h】

Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks

机译:使用SPEC ACCEL 1.2基准评估加速器编程模型的性能可移植性

获取原文

摘要

As heterogeneous architectures are becoming mainstream for HPC systems, application programmers are looking for programming model implementations that offer both performance and portability across platforms. Two directive-based programming models for accelerator programming that aim at doing this are OpenMP 4/4.5 and Ope-nACC. Many users want to know the difference between these two programming models, the state of their implementations, how to use them, and evaluate how suitable they are for their applications. The Standard Performance Evaluation Corporation (SPEC) ACCEL benchmarks, developed by the SPEC High Performance Group (HPG), recently released SPEC ACCEL 1.2 benchmark suite to help the evaluation of OpenCL, OpenMP 4.5 and OpenACC on different platforms. In this paper we present our preliminary results that evaluates OpenMP 4.5 and OpenACC on a variety of accelerator-based systems: POWER9 with NVIDIA V100 GPUs (Summit), Intel Xeon Phi 7230 (Percival), and AMD Bulldozer Opteron with NVIDIA K20x (Titan). Comparing these benchmarks on different systems gives us insight into the support for OpenMP and OpenACC and their execution times provide insights about their quality of implementations provided by different vendors. We also compare best of OpenMP and OpenACC to see if a particular programming model favors a particular type of benchmark kernel.
机译:随着异构体系结构成为HPC系统的主流,应用程序程序员正在寻找能够在平台之间提供性能和可移植性的编程模型实现。为此目的,针对加速器编程的两个基于指令的编程模型是OpenMP 4 / 4.5和Ope-nACC。许多用户想了解这两种编程模型之间的区别,实现的状态,如何使用它们以及评估它们是否适合其应用程序。由SPEC高性能组织(HPG)开发的标准性能评估公司(SPEC)ACCEL基准,最近发布了SPEC ACCEL 1.2基准套件,以帮助在不同平台上评估OpenCL,OpenMP 4.5和OpenACC。在本文中,我们介绍了初步结果,该结果在各种基于加速器的系统上评估OpenMP 4.5和OpenACC:配备NVIDIA V100 GPU的POWER9(Summit),英特尔至强融核7230(Percival)和具有NVIDIA K20x(Titan)的AMD Bulldozer Opteron 。在不同系统上比较这些基准,可以使我们深入了解对OpenMP和OpenACC的支持,它们的执行时间可以洞察不同供应商提供的实施质量。我们还比较了OpenMP和OpenACC的最佳性能,以查看特定的编程模型是否偏向于特定类型的基准内核。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号