首页> 外文会议>IEEE/ACM International Symposium on Code Generation and Optimization >LIFT: A functional data-parallel IR for high-performance GPU code generation
【24h】

LIFT: A functional data-parallel IR for high-performance GPU code generation

机译:LIFT:功能齐全的数据并行IR,用于生成高性能GPU代码

获取原文

摘要

Parallel patterns (e.g., map, reduce) have gained traction as an abstraction for targeting parallel accelerators and are a promising answer to the performance portability problem. However, compiling high-level programs into efficient low-level parallel code is challenging. Current approaches start from a high-level parallel IR and proceed to emit GPU code directly in one big step. Fixed strategies are used to optimize and map parallelism exploiting properties of a particular GPU generation leading to performance portability issues. We introduce the LIFT IR, a new data-parallel IR which encodes OpenCL-specific constructs as functional patterns. Our prior work has shown that this functional nature simplifies the exploration of optimizations and mapping of parallelism from portable high-level programs using rewrite-rules. This paper describes how LIFT IR programs are compiled into efficient OpenCL code. This is non-trivial as many performance sensitive details such as memory allocation, array accesses or synchronization are not explicitly represented in the LIFT IR. We present techniques which overcome this challenge by exploiting the pattern's high-level semantics. Our evaluation shows that the LIFT IR is flexible enough to express GPU programs with complex optimizations achieving performance on par with manually optimized code.
机译:并行模式(例如,映射,缩小)作为针对并行加速器的抽象已经获得了广泛的关注,并且是对性能可移植性问题的有希望的答案。然而,将高级程序编译成有效的低级并行代码是具有挑战性的。当前的方法是从高级并行IR开始,然后一步一步直接发出GPU代码。固定策略用于优化和映射利用特定一代GPU的性能的并行性,从而导致性能可移植性问题。我们介绍了LIFT IR,这是一种新的数据并行IR,它将OpenCL特定的构造编码为功能模式。我们之前的工作表明,这种功能性质简化了使用重写规则从可移植高级程序中对并行性进行优化和映射的探索。本文介绍如何将LIFT IR程序编译为有效的OpenCL代码。这是不平凡的,因为许多性能敏感的细节(例如内存分配,阵列访问或同步)没有在LIFT IR中明确表示。我们提出了通过利用模式的高级语义克服了这一挑战的技术。我们的评估表明,LIFT IR具有足够的灵活性,可以通过复杂的优化来表达GPU程序,从而达到与手动优化的代码相当的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号