GPGPU kernel implementation and refinement using Obsidian

Joel Svensson; Koen Claessen; Mary Sheeran

首页> 外文期刊>Procedia Computer Science >GPGPU kernel implementation and refinement using Obsidian

【24h】

GPGPU kernel implementation and refinement using Obsidian

机译：使用Obsidian的GPGPU内核实现和完善

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Obsidian is a domain specific language for data-parallel programming on graphics processors (GPUs). It is embedded in the functional programming language Haskell. The user writes code using constructs familiar from Haskell (like map and reduce), recursion and some specially designed combinators for combining GPU programs. NVIDIA CUDA code is generated from these high level descriptions, and passed to the nvcc compiler?[1]. Currently, we consider only the generation of single kernels, and not their coordination.This paper is focussed on how the user should work with Obsidian, starting with an obviously correct (or welltested) description of the required function, and refining it by the introduction of constructs to give finer control of the computation on the GPU. For some combinators, this approach results in CUDA code with satisfactory performance, promising increased productivity, as the high level descriptions are short and uncluttered. But for other combinators, the performance of generated code is not yet satisfactory. Ways to tackle this problem and plans to integrate Obsidian with another higher-level embedded language for GPU programming in Haskell are briefly discussed.

机译：黑曜石是用于图形处理器（GPU）上的数据并行编程的领域特定语言。它嵌入在功能编程语言Haskell中。用户使用Haskell熟悉的构造（例如map和reduce），递归和一些专门设计的用于组合GPU程序的组合器来编写代码。 NVIDIA CUDA代码是从这些高级描述生成的，并传递给nvcc编译器？[1]。当前，我们仅考虑单个内核的生成，而不考虑它们的协调。本文着重于用户应如何使用Obsidian，首先是对所需功能的明显正确（或经过测试）的描述，并通过介绍对其进行完善。可以更好地控制GPU上的计算的结构。对于某些组合器，由于高级描述简短且整洁，因此此方法可导致CUDA代码具有令人满意的性能，并有望提高生产率。但是对于其他组合器，生成的代码的性能还不能令人满意。简要讨论了解决此问题的方法以及计划将Obsidian与另一种用于Haskell中的GPU编程的高级嵌入式语言集成的方法。

著录项

来源
《Procedia Computer Science》 |2010年第1期|共10页
作者
Joel Svensson; Koen Claessen; Mary Sheeran;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Data-parallelEmbedded languageGPUsHaskell;

机译：数据并行嵌入式语言GPU Haskell;

相似文献

外文文献
中文文献
专利

1. Enabling GPGPU Low-Level Hardware Explorations with MIAOW: An Open-Source RTL Implementation of a GPGPU [J] . Balasubramanian Raghuraman, Gangadhar Vinay, Guo Ziliang, ACM Transactions on Architecture and Code Optimization . 2015,第2期

机译：使用MIAOW启用GPGPU低级硬件探索：GPGPU的开源RTL实现
2. An automatic adaptive refinement procedure for the reproducing kernel particle method. Part II: Adaptive refinement [J] . C. K. Lee, Y. Y. Shuai Computational Mechanics . 2007,第3期

机译：用于复制核粒子方法的自动自适应细化过程。第二部分：自适应细化
3. MiC: Multi-level Characterization and Optimization of GPGPU Kernels [J] . Liu Qixiao, Chen Zhifeng, Yu Zhibin ACM Journal on Emerging Technologies in Computing Systems . 2019,第3期

机译：MIC：GPGPU内核的多级别表征和优化
4. NK-GPGPU A GPGPU model for nested kernels [C] . Qianli XING, Liang HU, Xilong CHE International Conference on Automation, Mechanical Control and Computational Engineering . 2017

机译：NK-GPGPU A嵌套核的GPGPU模型
5. Analysis and Performance Optimization of a GPGPU Implementation of Image Quality Assessment (IQA) Algorithm VSNR. [D] . Gupta, Ayush. 2017

机译：GPGPU图像质量评估（IQA）算法VSNR实现的分析和性能优化。
6. GPGPU implementation of a synaptically optimized anatomically accurate spiking network simulator [O] . Ruggero Scorcioni 2010

机译：突触优化解剖学精确的尖峰网络模拟器的GPGPU实现
7. GPGPU kernel implementation and refinement using Obsidian [O] . Svensson Joel, Claessen Koen, Sheeran Mary 2010

机译：使用Obsidian的GPGPU内核实现和完善

GPGPU kernel implementation and refinement using Obsidian

摘要

著录项

相似文献

相关主题

期刊订阅