首页> 外文会议>International conference on high performance computing >Comparing Runtime Systems with Exascale Ambitions Using the Parallel Research Kernels
【24h】

Comparing Runtime Systems with Exascale Ambitions Using the Parallel Research Kernels

机译:使用并行研究内核将运行时系统与百亿美元的规模进行比较

获取原文

摘要

We use three Parallel Research Kernels to compare performance of a set of programming models(We employ the term programming model as it is commonly used in the application community. A more accurate term is programming environment, which is the collective of abstract programming model, embodiment of the model in an Application Programmer Interface (API), and the runtime that implements it.): MPI1 (MPI two-sided communication), MPIOPENMP (MPI+OpenMP), MPISHM (MPI1 with MPI-3 interprocess shared memory), MPIRMA (MPI one-sided communication), SHMEM, UPC, Charm++ and Grappa. The kernels in our study - Stencil, Synch_p2p and Transpose - underlie a wide range of computational science applications. They enable direct probing of properties of programming models, especially communication and synchronization. In contrast to mini- or proxy applications, the PRK allow for rapid implementation, measurement and verification. Our experimental results show MPISHM the overall winner, with MPI1, MPIOPENMP and SHMEM performing well. MPISHM and MPIOPENMP outperform the other models in the strong-scaling limit due to their effective use of shared memory and good granularity control. The non-evolutionary models Grappa and Charm++ are not competitive with traditional models (MPI and PGAS) for two of the kernels; these models favor irregular algorithms, while the PRK considered here are regular.
机译:我们使用三个并行研究内核来比较一组编程模型的性能(我们使用术语编程模型,因为它是应用程序社区中常用的术语。更准确的术语是编程环境,它是抽象编程模型,实施例的集合)应用程序程序员接口(API)中的模型及其实现的运行时。):MPI1(MPI双向通信),MPIOPENMP(MPI + OpenMP),MPISHM(具有MPI-3进程间共享内存的MPI1),MPIRMA (MPI单面通信),SHMEM,UPC,Charm ++和Grappa。我们研究的内核-Stencil,Synch_p2p和Transpose-构成了广泛的计算科学应用程序的基础。它们可以直接探测编程模型的属性,尤其是通信和同步。与小型或代理应用程序相比,PRK允许快速实施,测量和验证。我们的实验结果表明MPISHM总体上是赢家,MPI1,MPIOPENMP和SHMEM表现良好。由于MPISHM和MPIOPENMP有效使用共享内存和良好的粒度控制,因此它们在强扩展限制方面优于其他模型。对于两个内核,非进化模型Grappa和Charm ++与传统模型(MPI和PGAS)没有竞争力。这些模型支持不规则算法,而此处考虑的PRK是规则的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号