GPU-accelerated scalable solver for banded linear systems

机译：用于带状线性系统的GPU加速可扩展求解器

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Solving a banded linear system efficiently is important to many scientific and engineering applications. Current solvers achieve good scalability only on the linear systems that can be partitioned into independent subsystems. In this paper, we present a GPU based, scalable Bi-Conjugate Gradient Stabilized solver that can be used to solve a wide range of banded linear systems. We utilize a row-oriented matrix decomposition method to divide the banded linear system into several correlated sub-linear systems and solve them on multiple GPUs collaboratively. We design a number of GPU and MPI optimizations to speedup inter-GPU and inter-machine communications. We evaluate the solver on Poisson equation and advection diffusion equation as well as several other banded linear systems. The solver achieves a speedup of more than 21 times running from 6 to 192 GPUs on the XSEDE's Keeneland supercomputer and because of small communication overhead, can scale upto 32 GPUs on Amazon EC2 with relatively slow ethernet network.

机译：有效地解决带状线性系统对于许多科学和工程应用很重要。当前的求解器仅在可以划分为独立子系统的线性系统上才能实现良好的可伸缩性。在本文中，我们提出了一种基于GPU的可扩展双共轭梯度稳定求解器，该求解器可用于求解各种带状线性系统。我们利用面向行的矩阵分解方法将带状线性系统划分为几个相关的子线性系统，并在多个GPU上协同解决。我们设计了许多GPU和MPI优化，以加速GPU间和机器间的通信。我们评估泊松方程和对流扩散方程以及其他几个带状线性系统的求解器。该解决方案在XSEDE的Keeneland超级计算机上从6到192个GPU运行，可实现21倍以上的加速，并且由于通信开销较小，因此在具有相对较慢的以太网网络的Amazon EC2上可以扩展到32个GPU。

著录项

来源
《IEEE International Conference on Cluster Computing》|2013年|1-8|共8页
会议地点
作者
Liu Hang; Seo Jung-Hee; Mittal Rajat; Huang H.Howie;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Recurrent Neural-network-based Ultra-Fast, Robust and Scalable Solver of Linear Equation Systems with Potential Integration in a Speeding-up of FEM Solver Architectures [J] . Vahid Tavakkoli, Jean Chamberlain Chedjou, Kyandoghere Kyamakya Fortschritt-Berichte VDI . 2015,第842期

机译：基于递归神经网络的线性方程组超快速，鲁棒和可扩展求解器，可加速FEM求解器体系结构的集成
2. Highly scalable implementation of an implicit matrix-free solver for gas dynamics on GPU-accelerated clusters [J] . Menshov Igor, Pavlukhin Pavel Journal of supercomputing . 2017,第2期

机译：GPU加速集群上气体动力学的隐式无矩阵求解器的高度可扩展实现
3. Linear and nonlinear solvers for simulating multiphase flow within large-scale engineered subsurface systems [J] . Park Heeho D., Hammond Glenn E., Valocchi Albert J., Advances in Water Resources . 2021,第Octa期

机译：用于模拟大型工程地下系统内的多相流动的线性和非线性溶剂
4. GPU-accelerated scalable solver for banded linear systems [C] . Liu Hang, Seo Jung-Hee, Mittal Rajat, IEEE International Conference on Cluster Computing . 2013

机译：带状线性系统的GPU加速可伸缩求解器
5. GPUBLQMR: GPU-Accelerated Sparse Block Quasi-Minimum Residual Linear Solver [D] . Lacouture, Rubens. 2021

机译：GPublQMR：GPU加速稀疏块准余量剩余线性求解器
6. An analytical coupled technique for solving nonlinear large-amplitude oscillation of a conservative system with inertia and static non-linearity [O] . Md. Abdur Razzak, Md. Shamsul Alam -1

机译：具有惯性和静态非线性的保守系统非线性大振幅振荡的解析耦合技术
7. GPU-Accelerated Preconditioned Iterative Linear Solvers ∗ [O] . Ruipeng Li, Yousef Saad 2010

机译：GPU加速的预处理迭代线性求解器∗
8. Numerical solution of nonlinear algebraic equations in stiff ODE solving (1986--89)---Quasi-Newton updating for large scale nonlinear systems (1989--90). Final report, 1986--1990 [R] . Walker, H. F. 1990

机译：非线性代数方程在刚性ODE求解中的数值解（1986--89）---大规模非线性系统的拟牛顿更新（1989--90）。最终报告，1986- 1990

GPU-accelerated scalable solver for banded linear systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅