Twinned buffering: A simple and highly effective scheme for parallelization of Successive Over-Relaxation on GPUs and other accelerators

机译：Twinned Butfering：一种简单而高效的方案，用于在GPU和其他加速器上连续放松的并行化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a new scheme for parallelization of the Successive Over-Relaxation method for solving the Poisson equation over a 3-D volume. Our new scheme is both simple and effective, outperforming the conventional Red-Black scheme by a factor of 16 on an NVIDIA GeForce GTX 590 GPU, a factor of 11 on an NVIDIA GeForce TITAN Black GPU and a factor of 5 on an Intel Xeon Phi. The speed-up compared to the fully optimised reference implementation running on an Intel Xeon CPU is 16 times on the GTX 590, 22 times on the TITAN and 5 times on the Xeon Phi. We explain the rationale and the implementation in OpenCL and present the performance evaluation results.

机译：在本文中，我们提出了一种新的方案，用于通过三维体积求解泊松方程的连续过度放松方法的并行化。我们的新方案既简单又有效，优于常规的红黑色方案，在NVIDIA GeForce GTX 590 GPU上以16倍，在NVIDIA GeForce Titan Black GPU上的一个倍数为11个倍数和英特尔Xeon Phi 。与英特尔Xeon CPU上的完全优化的参考实现相比，GTX 590上运行的完全优化的参考实现相比，泰坦的22次和Xeon Phi上的5次。我们解释了OpenCL中的理由和实施，并提出了绩效评估结果。

著录项

来源
《International Conference on High Performance Computing Simulation》|2015年||共8页
会议地点
作者
Vanderbauwhede Wim; Takemi Tetsuya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类一般性问题;
关键词
General-Purpose computation on Graphics Processing Units (GPGPU); Large Scale Scientific Computing; Parallelization of Simulation;

机译：图形处理单元（GPGPU）上的通用计算;大规模的科学计算;模拟的并行化;

相似文献

外文文献
中文文献
专利

1. A study of successive over-relaxation method parallelisation over modern HPC languages [J] . Sparsh Mittal International Journal of High Performance Computing and Networking . 2014,第4期

机译：现代HPC语言上连续超松弛方法并行化的研究
2. Parallelization of 2D MPDATA EULAG algorithm on hybrid architectures with GPU accelerators [J] . Roman Wyrzykowski, Lukasz Szustak, Krzysztof Rojek Parallel Computing . 2014,第8期

机译：使用GPU加速器的混合架构上的2D MPDATA EULAG算法并行化
3. An Effective Load Balancing Scheme for 3D Texture-Based Sort-Last Parallel Volume Rendering on GPU Clusters [J] . Won-Jong LEE, Vason P. SRINI, Woo-Chan PARK, IEICE Transactions on Information and Systems . 2008,第3期

机译：GPU群集上基于3D纹理的排序-最后并行体积渲染的有效负载平衡方案
4. Twinned buffering: A simple and highly effective scheme for parallelization of Successive Over-Relaxation on GPUs and other accelerators [C] . Vanderbauwhede Wim, Takemi Tetsuya International Conference on High Performance Computing Simulation . 2015

机译：孪生缓冲：一种简单高效的方案，用于在GPU和其他加速器上并行化连续过度松弛
5. Communication and Coordination Paradigms for Highly-parallel Accelerators. [D] . Orr, Marc S. 2016

机译：高度并行加速器的通信和协调范例。
6. gEMpicker: a highly parallel GPU-accelerated particle picking tool for cryo-electron microscopy [O] . Thai V Hoang, Xavier Cavin, Patrick Schultz, 2013

机译：gEMpicker：用于冷冻电子显微镜的高度并行GPU加速的粒子拾取工具
7. Twinned buffering: A simple and highly effective scheme for parallelization of Successive Over-Relaxation on GPUs and other accelerators [O] . Wim Vanderbauwhede, Tetsuya Takemi 2015

机译：Twinned Butfering：一种简单而高效的方案，用于在GPU和其他加速器上连续放松的并行化
8. Parallelized Point Successive Over-Relaxation Method on a Multiple Instruction Multiple Data Stream Computer [R] . Patel, N. R., Sturek, W. B., Jordan, H. F. 1984

机译：多指令多数据流计算机上的并行点连续过松弛方法

Twinned buffering: A simple and highly effective scheme for parallelization of Successive Over-Relaxation on GPUs and other accelerators

摘要

著录项

相似文献

相关主题

期刊订阅