A block-asynchronous relaxation method for graphics processing units

Hartwig Anzt; Stanimire Tomov; Jack Dongarra; Vincent Heuveline

首页> 外文期刊>Journal of Parallel and Distributed Computing >A block-asynchronous relaxation method for graphics processing units

【24h】

A block-asynchronous relaxation method for graphics processing units

机译：图形处理单元的块异步松弛方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we analyze the potential of asynchronous relaxation methods on Graphics Processing Units (CPUs). We develop asynchronous iteration algorithms in CUDA and compare them with parallel implementations of synchronous relaxation methods on CPU- or GPU-based systems. For a set of test matrices from UFMC we investigate convergence behavior, performance and tolerance to hardware failure. We observe that even for our most basic asynchronous relaxation scheme, the method can efficiently leverage the CPUs computing power and is, despite its lower convergence rate compared to the Gauss-Seidel relaxation, still able to provide solution approximations of certain accuracy in considerably shorter time than Gauss-Seidel running on CPUs- or GPU-based Jacobi. Hence, it overcompensates for the slower convergence by exploiting the scalability and the good fit of the asynchronous schemes for the highly parallel GPU architectures. Further, enhancing the most basic asynchronous approach with hybrid schemes-using multiple iterations within the "subdomain" handled by a GPU thread block-we manage to not only recover the loss of global convergence but often accelerate convergence of up to two times, while keeping the execution time of a global iteration practically the same. The combination with the advantageous properties of asynchronous iteration methods with respect to hardware failure identifies the high potential of the asynchronous methods for Exascale computing.

机译：在本文中，我们分析了图形处理单元（CPU）上异步松弛方法的潜力。我们在CUDA中开发异步迭代算法，并将其与基于CPU或GPU的系统上同步松弛方法的并行实现进行比较。对于UFMC的一组测试矩阵，我们研究了收敛行为，性能和对硬件故障的耐受性。我们观察到，即使对于我们最基本的异步松弛方案，该方法也可以有效地利用CPU的计算能力，尽管与高斯-塞德尔松弛相比其收敛速度较低，但仍能够在相当短的时间内提供一定精度的解决方案近似值而不是在基于CPU或GPU的Jacobi上运行的Gauss-Seidel。因此，它通过利用高度并行GPU架构的异步方案的可伸缩性和良好适应性来补偿较慢的收敛。此外，通过混合方案增强最基本的异步方法-使用GPU线程块处理的“子域”内的多次迭代-我们不仅设法恢复全局收敛性的损失，而且还经常加速收敛两次，同时保持全局迭代的执行时间几乎相同。异步迭代方法在硬件故障方面的优势与优势相结合，确定了异步方法在Exascale计算中的巨大潜力。

著录项

来源
《Journal of Parallel and Distributed Computing》 |2013年第12期|1613-1626|共14页
作者
Hartwig Anzt; Stanimire Tomov; Jack Dongarra; Vincent Heuveline;
展开▼
作者单位

Karlsruhe Institute of Technology, Germany;

University of0 Tennessee Knoxville, USA;

University of0 Tennessee Knoxville, USA,Oak Ridge National Laboratory, USA, University of Manchester, UK;

Karlsruhe Institute of Technology, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Asynchronous relaxation; Chaotic iteration; Graphics processing units (CPUs); Jacobi method;

机译：异步放松;混沌迭代图形处理单元（CPU）;雅可比法;

相似文献

外文文献
中文文献
专利

1. Efficient 2D and 3D watershed on graphics processing unit: block-asynchronous approaches based on cellular automata [J] . Pablo Quesada-Barriuso, Dora B. Heras, Francisco Arguello Computers and Electrical Engineering . 2013,第8期

机译：图形处理单元上的有效2D和3D分水岭：基于元胞自动机的块异步方法
2. A distributed parallel multiple-relaxation-time lattice Boltzmann method on general-purpose graphics processing units for the rapid and scalable computation of absolute permeability from high-resolution 3D micro-CT images [J] . F. O. Alpak, F. Gray, N. Saxena, Computational Geosciences . 2018,第3期

机译：通用图形处理单元上的分布式并行多松弛时间点阵玻尔兹曼方法，可从高分辨率3D micro-CT图像快速，可扩展地计算绝对渗透率
3. Simulations of flow instability in three dimensional deep cavities with multi relaxation time lattice Boltzmann method on graphic processing units [J] . Hung-Wen Chang, Pei-Yao Hong, Li-Song Lin, Computers & Fluids . 2013,第Null期

机译：在图形处理单元上用多重弛豫时间格子Boltzmann方法模拟三维深腔中的流动不稳定性
4. A Block-Asynchronous Relaxation Method for Graphics Processing Units [C] . Anzt Hartwig, Tomov Stanimire, Dongarra Jack, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops amp; PhD Forum . 2012

机译：图形处理单元的块异步松弛方法
5. Parallel Implementation of Resampling Methods for Particle Filtering on Graphics Processing Units [D] . Nicely, Matthew A. 2019

机译：图形处理单元粒子滤波重采样方法的平行实现
6. Acceleration of Linear Finite-Difference Poisson-Boltzmann Methods on Graphics Processing Units [O] . Ruxi Qi, Wesley M. Botello-Smith, Ray Luo -1

机译：图形处理单元上线性有限差分泊松-玻尔兹曼方法的加速
7. A Block-Asynchronous Relaxation Method for Graphics Processing Units [O] . Hartwig Anzt, Stanimire Tomov, Jack Dongarra, Vincent Heuveline 2012

机译：图形处理单元的块异步松弛方法

A block-asynchronous relaxation method for graphics processing units

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅