Automatic Parallelization of GPU Applications Using OpenCL

机译：使用OpenCL自动并行化GPU应用程序

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications due to their computation power and the availability of programming languages that make more approachable writing scientific applications for GPUs. However, since the programming model of GPUs requires offloading all the data to the GPU memory, the memory footprint of the application is limited to the size of the GPU memory. Multi-GPU systems can make memory limited problems tractable by parallelizing the computation and data among the available GPUs. Parallelizing applications written for running on single-GPU systems can be done (i) at runtime through an environment that captures the memory operations and kernel calls and distributes among the available GPUs, and (ii) at compile time through a pre-compiler that transforms the application for decomposing the data and computation among the available GPUs. In this paper we propose a framework and implement a tool that transforms an OpenCL application written to run on single-GPU systems into one that runs on multi-GPU systems. Based on data dependencies and data usage analysis, the application is transformed to decompose data and computation among the available GPUs. To reduce the data transfer overhead, computation-communication overlapping techniques are utilized. We tested our tool using two applications with different data transfer requirements, for the application with no data transfer requirements, a linear speedup is achieved, while for the application with data transfers, the computation-communication overlapping reduces the communication overhead by 40%.

机译：由于图形处理单元（GPU）的计算能力和编程语言的可用性，这些图形处理单元（GPU）已成功用于加速科学应用程序，从而使GPU编写科学应用程序的方式更加平易近人。但是，由于GPU的编程模型要求将所有数据卸载到GPU内存，因此应用程序的内存占用空间仅限于GPU内存的大小。多GPU系统可以通过并行化可用GPU之间的计算和数据来解决内存有限的问题。为在单GPU系统上运行而编写的并行应用程序可以（i）在运行时通过捕获内存操作和内核调用并在可用GPU之间分配的环境来完成，以及（ii）在编译时通过进行转换的预编译器来完成。用于在可用GPU之间分解数据和计算的应用程序。在本文中，我们提出了一个框架并实现了一种工具，该工具可以将编写用于在单GPU系统上运行的OpenCL应用程序转换为可以在多GPU系统上运行的应用程序。基于数据依赖关系和数据使用情况分析，可以对应用程序进行转换，以分解可用GPU之间的数据和计算。为了减少数据传输开销，利用了计算通信重叠技术。我们使用两个具有不同数据传输要求的应用程序测试了我们的工具，对于不具有数据传输要求的应用程序，实现了线性加速，而对于具有数据传输的应用程序，计算-通信重叠将通信开销减少了40％。

著录项

来源
《Asia-Pacific Conference on Computer Aided System Engineering》|2015年|276-283|共8页
会议地点
作者
Solano-Quinde Lizandro D.; Bode Brett M.; Somani Arun K.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
GPU; OpenCL; Program Transformation;

机译：GPU; OpenCL;程序转换;

相似文献

外文文献
中文文献
专利

1. Automatic and Portable Mapping of Data Parallel Programs to OpenCL for GPU-Based Heterogeneous Systems [J] . Wang Zheng, Grewe Dominik, OBoyle Michael F. P. ACM Transactions on Architecture and Code Optimization . 2014,第4期

机译：自动和可移植的数据并行程序到基于GPU的异构系统OpenCL的映射
2. Automatic CPU/GPU Generation of Multi-versioned OpenCL Kernels for C++ Scientific Applications [J] . Rafael Sotomayor, Luis Miguel Sanchez, Javier Garcia Bias, International journal of parallel programming . 2017,第2期

机译：用于C ++科学应用程序的多版本OpenCL内核的自动CPU / GPU生成
3. Parallel H.264/AVC Motion Compensation for GPUs Using OpenCL [J] . Wang Biao, Alvarez-Mesa Mauricio, Chi Chi Ching, Circuits and Systems for Video Technology, IEEE Transactions on . 2015,第3期

机译：使用OpenCL的GPU的并行H.264 / AVC运动补偿
4. Automatic Parallelization of GPU Applications Using OpenCL [C] . Solano-Quinde Lizandro D., Bode Brett M., Somani Arun K. Asia-Pacific Conference on Computer Aided System Engineering . 2015

机译：使用OpenCL自动并行化GPU应用程序
5. Automatic Parallelization for GPUs [D] . Jablin, Thomas B. 2013

机译：GPU的自动并行化
6. Moving GPU‐OpenCL‐based Monte Carlo dose calculation toward clinical use: Automatic beam commissioning and source sampling for treatment plan dose calculation [O] . Zhen Tian, Yongbao Li, Nima Hassan‐Rezaeian, 2017

机译：将基于GPU OpenCL的蒙特卡洛剂量计算推向临床使用：自动光束调试和源采样用于治疗计划剂量计算
7. Automatic translation of CUDA to OpenCL and comparison of performance optimizations on GPUS [O] . Nandakumar Deepthi 2011

机译：将CUDa自动转换为OpenCL并比较GpUs上的性能优化

Automatic Parallelization of GPU Applications Using OpenCL

摘要

著录项

相似文献

相关主题

期刊订阅