Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

Yang Chun-Chieh; Wang Shao-Chung; Hsu Min-Yi; Chang Yuan-Ming; Hwang Yuan-Shin; Lee Jenq-Kuen

首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

【24h】

Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

机译：在LLVM上为PTX模拟器支持OpenCL 2.0编译器

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Heterogeneous systems that consist of multiple CPUs and GPUs for high-performance computing are becoming increasingly popular, and OpenCL (Open Computing Language) provides a framework for writing programs that can be executed across heterogeneous devices. Compared with OpenCL 1.2, the new features of OpenCL 2.0 provide developers with better expressive power for programming heterogeneous computing environments. Currently, gem5-gpu, which includes gem5 and GPGPU-Sim, can offer an experimental simulation environment for OpenCL. In gem5-gpu, gem5 only supports CUDA, although GPGPU-Sim can support OpenCL by compiling an OpenCL kernel code to PTX code using real GPU drivers. However, this compilation flow in GPGPU-Sim can only support up to OpenCL 1.2. OpenCL 2.0 provides new features such as workgroup built-in functions, extended atomic built-in functions, and device-side enqueue. To support OpenCL 2.0, the compiler must be extended to enable the compilation of OpenCL 2.0 kernel code to PTX code. In this paper, the proposed compiler is modified from the low level virtual machine (LLVM) compiler to extend such features to enhance the emulator to support OpenCL 2.0. The proposed compiler creates local buffers for each workgroup to enable workgroup built-in functions and adds atomic built-in functions with memory order and memory scope for OpenCL 2.0 in NVPTX. Furthermore, the APIs available in CUDA are utilized to implement the OpenCL 2.0 device-side enqueue kernel and compilation schemes in Clang are revised. The AMD APP SDK 3.0 and NTU OpenCL benchmarks are used to verify that the proposed compiler can support the features of OpenCL 2.0.

机译：由多个CPU和GPU组成的用于高性能计算的异构系统正变得越来越流行，OpenCL（开放计算语言）提供了一个框架，可用于编写可在异构设备上执行的程序。与OpenCL 1.2相比，OpenCL 2.0的新功能为开发人员提供了对异构计算环境进行编程的更好的表达能力。目前，包含gem5和GPGPU-Sim的gem5-gpu可以为OpenCL提供实验性的仿真环境。在gem5-gpu中，gem5仅支持CUDA，尽管GPGPU-Sim可以通过使用真正的GPU驱动程序将OpenCL内核代码编译为PTX代码来支持OpenCL。但是，GPGPU-Sim中的此编译流程最多只能支持OpenCL 1.2。 OpenCL 2.0提供了新功能，例如工作组内置功能，扩展的原子内置功能和设备端入队。为了支持OpenCL 2.0，必须扩展编译器以启用将OpenCL 2.0内核代码编译为PTX代码。在本文中，从低层虚拟机（LLVM）编译器修改了所建议的编译器，以扩展此类功能，以增强仿真器以支持OpenCL 2.0。拟议的编译器为每个工作组创建本地缓冲区以启用工作组内置功能，并为NVPTX中的OpenCL 2.0添加原子内置功能以及内存顺序和内存范围。此外，CUDA中可用的API用于实现OpenCL 2.0设备端队列内核，并修改了Clang中的编译方案。 AMD APP SDK 3.0和NTU OpenCL基准用于验证建议的编译器可以支持OpenCL 2.0的功能。

著录项

来源
《Journal of signal processing systems for signal, image, and video technology》 |2019年第4期|261-271|共11页
作者
Yang Chun-Chieh; Wang Shao-Chung; Hsu Min-Yi; Chang Yuan-Ming; Hwang Yuan-Shin; Lee Jenq-Kuen;
展开▼
作者单位

Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan;

Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan;

Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan;

Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan;

Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei, Taiwan;

Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
OpenCL; Gem5-gpu; LLVM; Libclc; PTX;

机译：OpenCL;Gem5-gpu;LLVM;Libclc;PTX;

相似文献

外文文献
中文文献
专利

1. LLVM-based automation of memory decoupling for OpenCL applications on FPGAs [J] . Purkayastha Arnab A., Rogers Samuel, Shiddibhavi Suhas A., Microprocessors and microsystems . 2020,第Feba期

机译：FPGA上针对OpenCL应用程序的基于LLVM的内存去耦自动化
2. Enabling PoCL-based runtime frameworks on the HSA for OpenCL 2.0 support [J] . Yuan-Ming Chang, Shao-Chung Wang, Chun-Chieh Yang, Journal of systems architecture . 2017,第期

机译：在HSA上启用基于POCL的运行时框架，用于OpenCL 2.0支持
3. Predicting HPC parallel program performance based on LLVM compiler [J] . Zhang Weizhe, Hao Meng, Snir Marc Cluster computing . 2017,第2期

机译：基于LLVM编译器预测HPC并行程序性能
4. OpenCL 2.0 Compiler Adaptation on LLVM for PTX Simulators [C] . Chun-Chieh Yang, Shao-Chung Wang, Min-Yi Hsu, International Workshop on Embedded Multicore Systems . 2017

机译：用于PTX模拟器的LLVM上的OpenCL 2.0编译器适应
5. C# compiler extension to support the Object Constraint Language version 2.0 [D] . Arnold, David 2004

机译：C＃编译器扩展，以支持对象约束语言2.0版
6. ccPDB 2.0: an updated version of datasets created and compiled from Protein Data Bank [O] . Piyush Agrawal, Sumeet Patiyal, Rajesh Kumar, 2019

机译：ccPDB 2.0：从Protein Data Bank创建和编译的数据集的更新版本
7. SIMD Instructions Support in LLVM Compiler [O] . Šnobl Pavel 2014

机译：LLVM编译器中的SIMD指令支持

Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

摘要

著录项

相似文献

相关主题

期刊订阅