Acceleration Techniques for FETI Solvers for GPU Accelerators

机译：用于GPU加速器的FETI溶剂的加速技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we evaluate several approaches to performing simultaneous matrix-vector multiplication of large numbers of matrices on a GPU accelerator. The goal of this evaluation is to develop efficient techniques for massively parallel Hybrid Total FETI solvers in our ESPRESO library. FETI solvers generally use sparse matrices. To overcome this we previously proposed the Local Schur Complement method for FETI to convert sparse matrices to their dense representation, without significantly increasing the memory requirements of the GPU accelerator. We selected the following techniques: standard GEMV, CUDA streams, dynamic parallelism, batched GEMM, BSR GEMV and HYB GEMV. Our results show that (i) if a FETI solver contains a large number of small matrices i.e. there is large number of small subdomains, then the best approach is dynamic parallelism; (ii) if there is small number of large subdomains, then the optimal approaches are dynamic parallelism and CUDA streams. Please note that Local Schur Complement method in conjunction with Hybrid Total FETI perform better with smaller subdomains.

机译：在本文中，我们在GPU加速器上评估了在GPU加速器上执行大量矩阵的同步矩阵乘法的方法。该评估的目标是为我们的ESPRESO图书馆中的大规模平行杂交总索赔的技术开发有效的技术。纤维溶剂通常使用稀疏矩阵。为了克服这一点，我们以前提出了本地SCHUR补充方法，用于将稀疏矩阵转换为其密集表示，而不会显着提高GPU加速器的内存要求。我们选择了以下技术：标准Gemv，Cuda流，动态并行，批量宝石，BSR Gemv和Hyb Gemv。我们的结果表明，（i）如果FETI求解器包含大量的小矩阵，则有大量的小亚域，那么最好的方法是动态的并行性; （ii）如果大量的大域数量，则最佳方法是动态的并行性和CUDA流。请注意，本地SCUR补充方法与混合动力总量一起执行更好的子域。

著录项

来源
《International Conference on High Performance Computing and Simulation》|2018年|523-1070p|共8页
会议地点
作者
Radim Vavrik; Lubomir Riha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP30-53;
关键词
High performance computing; GPU; CUDA; GEMV; Matrix-vector multiplication; Dense linear algebra; LSC;

机译：高性能计算;GPU;CUDA;GEMV;矩阵矢量乘法;致密线性代数;LSC;

相似文献

外文文献
中文文献
专利

1. Correction to: Leveraging HPC accelerator architectures with modern techniques - hydrologic modeling on GPUs with ParFlow [J] . Hokkanen Jaro, Kollet Stefan, Kraus Jiri, Computational Geosciences . 2021,第5期

机译：纠正：利用现代技术利用HPC加速器架构 - 用Parflow在GPU上进行水文建模
2. Leveraging HPC accelerator architectures with modern techniques - hydrologic modeling on GPUs with ParFlow [J] . Hokkanen Jaro, Kollet Stefan, Kraus Jiri, Computational Geosciences . 2021,第5期

机译：利用现代技术利用HPC加速器架构 - 用Parflow进行GPU的水文建模
3. A massively parallel and memory-efficient FEM toolbox with a hybrid total FETI solver with accelerator support [J] . Riha Lubomir, Merta Michal, Vavrik Radim, Experimental Mechanics . 2019,第4期

机译：大型并行和内存高效的FEM工具箱，带有混合式FETI混合求解器，并支持加速器
4. Acceleration Techniques for FETI Solvers for GPU Accelerators [C] . Radim Vavrík, Lubomír Ríha International Conference on High Performance Computing Simulation . 2018

机译：针对GPU加速器的FETI解算器的加速技术
5. Performance analysis and acceleration of nuclear physics application on high-performance computing platforms using GPGPUs and topology-aware mapping techniques [D] . Oryspayev, Dossay. 2016

机译：使用GPGPU和拓扑信息映射技术对高性能计算平台核物理应用的性能分析与加速
6. A survey of GPU-based acceleration techniques in MRI reconstructions [O] . Haifeng Wang, Hanchuan Peng, Yuchou Chang, 2018

机译：MRI重建中基于GPU的加速技术的调查
7. Leveraging HPC accelerator architectures with modern techniques — hydrologic modeling on GPUs with ParFlow [O] . Jaro Hokkanen, Stefan Kollet, Jiri Kraus, 2021

机译：利用现代技术利用HPC加速器架构 - Parflow上GPU的水文建模

Acceleration Techniques for FETI Solvers for GPU Accelerators

摘要

著录项

相似文献

相关主题

期刊订阅