首页> 外文期刊>International journal of parallel programming >Towards High-Performance Code Generation for Multi-GPU Clusters Based on a Domain-Specific Language for Algorithmic Skeletons
【24h】

Towards High-Performance Code Generation for Multi-GPU Clusters Based on a Domain-Specific Language for Algorithmic Skeletons

机译:基于算法骨架的域特定语言,对多GPU集群的高性能代码生成

获取原文
获取原文并翻译 | 示例
           

摘要

In earlier work, we defined a domain-specific language (DSL) with the aim to provide an easy-to-use approach for programming multi-core and multi-GPU clusters. The DSL incorporates the idea of utilizing algorithmic skeletons, which are well-known patterns for parallel programming, such as map and reduce. Based on the chosen skeleton, a user-defined function can be applied to a data structure in parallel with the main advantage that the user does not have to worry about implementation details. So far, we had only implemented a generator for multi-core clusters and in this paper we present and evaluate two prototypes of generators for multi-GPU clusters, which are based on OpenACC and CUDA. We have evaluated the approach with four benchmark applications. The results show that the generation approach leads to execution times, which are on par with an alternative library implementation.
机译:在早期的工作中,我们定义了一种特定于域的语言(DSL),目的是为编程多核和多GPU集群提供易于使用的方法。 DSL包含利用算法骨架的想法,这些骨架是用于并行编程的众所周知的模式,例如MAP和REAFE。基于所选择的骨架,可以与用户不必担心实现细节的主要优点来将用户定义的函数并行应用于数据结构。到目前为止,我们只为多核集群实施了一个发电机,并且在本文中,我们展示并评估了用于多GPU集群的两个发电机原型,基于OpenACC和CUDA。我们已经评估了具有四个基准应用的方法。结果表明,生成方法导致执行时间,与替代图书馆实现有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号