GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

Pikle Nileshchandra K.; Sathe Shailesh R.; Vyavhare Arvind Y.

首页> 外文期刊>Sadhana: Academy Proceedings in Engineering Science >GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

【24h】

GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

机译：基于GPGPU的并行计算应用于使用共轭梯度算法的有限元素：综述

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallelization of the finite-element method (FEM) has been contemplated by the scientific and high-performance computing community for over a decade. Most of the computations in the FEM are related to linear algebra that includes matrix and vector computations. These operations have the single-instruction multiple-data (SIMD) computation pattern, which is beneficial for shared-memory parallel architectures. General-purpose graphics processing units (GPGPUs) have been effectively utilized for the parallelization of FEM computations ever since 2007. The solver step of the FEM is often carried out using conjugate gradient (CG)-type iterative methods because of their larger convergence rates and greater opportunities for parallelization. Although the SIMD computation patterns in the FEM are intrinsic for GPU computing, there are some pitfalls, such as the underutilization of threads, uncoalesced memory access, lower arithmetic intensity, limited faster memories on GPUs and synchronizations. Nevertheless, FEM applications have been successfully deployed on GPUs over the last 10 years to achieve a significant performance improvement. This paper presents a comprehensive review of the parallel optimization strategies applied in each step of the FEM. The pitfalls and trade-offs linked to each step in the FEM are also discussed in this paper. Furthermore, some extraordinary methods that exploit the tremendous amount of computing power of a GPU are also discussed. The proposed review is not limited to a single field of engineering. Rather, it is applicable to all fields of engineering and science in which FEM-based simulations are necessary.

机译：有限元方法（FEM）的并行化已经被科学和高性能计算界占据了十多年。 FEM中的大多数计算与包括矩阵和矢量计算的线性代数有关。这些操作具有单指令多数据（SIMD）计算模式，这对于共享存储器并行架构是有益的。自2007年以来，已经有效地利用了通用图形处理单元（GPGPU）的有效计算的并行化。由于其较大的收敛速率和收敛速率，通常使用缀合物梯度（CG）迭代方法进行FEM的求解步骤。平行化的更多机会。虽然FEM中的SIMD计算模式是GPU计算的内在，但有一些陷阱，例如线程的未充分利用，未扩展的存储器访问，较低的算术强度，对GPU的更快的存储器和同步。尽管如此，在过去10年中已成功部署了FEM应用程序，以实现显着的性能改进。本文介绍了对FEM的每个步骤中应用的并行优化策略的全面审查。本文还讨论了与FEM中的每个步骤相关的陷阱和权衡。此外，还讨论了利用GPU的巨大计算能力的一些非凡方法。拟议的审查不仅限于单一工程领域。相反，它适用于所有工程和科学领域，其中基于FEM的模拟是必要的。

著录项

来源
《Sadhana: Academy Proceedings in Engineering Science》 |2018年第7期|共21页
作者
Pikle Nileshchandra K.; Sathe Shailesh R.; Vyavhare Arvind Y.;
展开▼
作者单位

Visvesvaraya Natl Inst Technol Dept Comp Sci &

Engn Nagpur Maharashtra India;

Visvesvaraya Natl Inst Technol Dept Comp Sci &

Engn Nagpur Maharashtra India;

Visvesvaraya Natl Inst Technol Dept Appl Mech Nagpur Maharashtra India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Finite-element method (FEM); conjugate gradient (CG); sparse matrix-vector multiplication (SpMV); assembly-free FEM (AF-FEM); graphics processing units (GPUs); compute unified device architecture (CUDA); parallel computing;

机译：有限元方法（FEM）;共轭梯度（CG）;稀疏矩阵 - 向量乘法（SPMV）;无组件FEM（AF-FEM）;图形处理单元（GPU）;计算统一设备架构（CUDA）;并行计算;

相似文献

外文文献
中文文献
专利

1. GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review [J] . NILESHCHANDRA K PIKLE, SHAILESH R SATHE, ARVIND Y VYAVHARE Sadhana . 2018,第7期

机译：基于共轭梯度算法的基于GPGPU的并行计算在FEM中的应用
2. Parallel Computing Algorithms and Applications [review of Scientific Parallel Computing by Scott, L.R. et al.; 2005] [J] . Raghunathan Sudarshan Computing in science & engineering . 2007,第4期

机译：并行计算算法和应用[Scott，L.R.等； 2005]
3. An improved parallel hybrid bi-conjugate gradient method suitable for distributed parallel computing [J] . Gu TX, Zuo XY, Liu XP, Journal of Computational and Applied Mathematics . 2009,第1期

机译：一种适用于分布式并行计算的改进的并行混合双共轭梯度方法
4. Optimization of the Deflated Conjugate Gradients algorithm applied to the massively parallel LES of heat transfer in gas turbines [C] . Mathias Malandain, Nicolas Maheu, Vincent Moureau International Symposium on Turbulence, Heat and Mass Transfer . 2012

机译：燃气轮机在燃气轮机中大平行平行传热施加的微量平行梯度算法的优化
5. Performance estimation of heterogeneous distributed computing systems that compute parallel algorithms. [D] . Gutierrez Casas, Efren David. 1997

机译：计算并行算法的异构分布式计算系统的性能估计。
6. Two New PRP Conjugate Gradient Algorithms for Minimization Optimization Models [O] . Gonglin Yuan, Xiabin Duan, Wenjie Liu, -1

机译：用于最小化优化模型的两种新的PRP共轭梯度算法
7. Parallelization and Performance of Conjugate Gradient Algorithms on the Cedar hierarchical-memory Multiprocessor [O] . Ulrike Meier, Rudolf Eigenmann 1991

机译：雪松分层记忆多处理器中共轭梯度算法的并行化与性能
8. Optimisation Algorithms for Highly Parallel Computer Architectures. The Performance of the Truncated Newton, Conjugate Gradient Algorithm in FORTRAN and ADA [R] . Dixon, L. C., Maany, Z. A. 1988

机译：高度并行计算机体系结构的优化算法。 FORTRaN和aDa中截断牛顿共轭梯度算法的性能

GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

摘要

著录项

相似文献

相关主题

期刊订阅