Steps towards GPU Accelerated Aggregation AMG

机译：GPU加速聚合AMG的步骤

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an implementation of AMG with simple aggregation techniques on multiple GPUs. It supports the parallel matrix representations typically used for finite volume discretisation. We employ the ICRS sparse matrix format and the asynchronous exchange mechanism of MPI on CPUs that has been modified to make it suitable for the GPU coprocessors. We show that the solution phase of the standard v-cycle AMG with simple aggregation is accelerated by a factor of up to 12. The solution phase of the more advanced Krylov-accelerated AMG runs faster by a factor of up to 7 on Nvidia TESLA C2070 compared to calculation on Intel X5650 CPUs.

机译：我们在多个GPU上呈现了具有简单聚合技术的AMG的实现。它支持通常用于有限体积离散的并行矩阵表示。我们采用ICRS稀疏矩阵格式和MPI对CPU的异步交换机制已被修改为使其适用于GPU协处理器。我们表明，具有简单聚集的标准V周期AMG的解决方案阶段加速了最多12的因子。更先进的Krylov-Concelerated AMG的溶液阶段在NVIDIA Tesla C2070上的倍数最高7倍。与Intel X5650 CPU的计算相比。

著录项

来源
《International Symposium on Parallel and Distributed Computing》|2012年||共8页
会议地点
作者
Emans Maximilian; Liebmann Manfred; Basara Branislav;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP316.4-53;
关键词

相似文献

外文文献
中文文献
专利

1. AMGX: A LIBRARY FOR GPU ACCELERATED ALGEBRAIC MULTIGRID AND PRECONDITIONED ITERATIVE METHODS [J] . Naumov M., Arsaev M., Castonguay P., SIAM Journal on Scientific Computing . 2015,第5期

机译：AMGX：GPU加速的代数多重网格和预设迭代方法的库
2. Accelerating Industrial Applications: The Development of Basic GPU Kernels for the New Block AMG Algorithms for Solving SLE with Explicitly Calculated Sparse Basis [J] . Ilya Afanasyev, Yury Potapov, Sergey Sobolev, Procedia Computer Science . 2015,第1期

机译：加速工业应用：针对新块AMG算法的基本GPU内核的开发，该算法可通过显式计算的稀疏基础解决SLE
3. A local time stepping algorithm for GPU-accelerated 2D shallow water models [J] . Dazzi Susanna, Vacondio Renato, Dal Palu Alessandro, Advances in Water Resources . 2018,第jana期

机译：GPU加速的2D浅水模型的局部时间步长算法
4. Steps towards GPU Accelerated Aggregation AMG [C] . Emans Maximilian, Liebmann Manfred, Basara Branislav 2012 11th International Symposium on Parallel and Distributed Computing. . 2012

机译：迈向GPU加速聚合AMG的步骤
5. GPUBLQMR: GPU-Accelerated Sparse Block Quasi-Minimum Residual Linear Solver [D] . Lacouture, Rubens. 2021

机译：GPublQMR：GPU加速稀疏块准余量剩余线性求解器
6. Accelerating the Finite-Element Method for Reaction-Diffusion Simulations on GPUs with CUDA [O] . Hedi Sellami, Leo Cazenille, Teruo Fujii, 2020

机译：加速CUDA对GPU反应扩散模拟的有限元法
7. Accelerating Industrial Applications: The Development of Basic GPU Kernels for the New Block AMG Algorithms for Solving SLE with Explicitly Calculated Sparse Basis [O] . Afanasyev Ilya, Potapov Yury, Sobolev Sergey, 2015

机译：加速工业应用：针对新块AMG算法的基本GPU内核的开发，该算法可通过显式计算的稀疏基础解决SLE

Steps towards GPU Accelerated Aggregation AMG

摘要

著录项

相似文献

相关主题

期刊订阅