首页> 外文会议>IEEE International Conference on Cluster Computing >GPU-accelerated scalable solver for banded linear systems
【24h】

GPU-accelerated scalable solver for banded linear systems

机译:用于带状线性系统的GPU加速可扩展求解器

获取原文
获取外文期刊封面目录资料

摘要

Solving a banded linear system efficiently is important to many scientific and engineering applications. Current solvers achieve good scalability only on the linear systems that can be partitioned into independent subsystems. In this paper, we present a GPU based, scalable Bi-Conjugate Gradient Stabilized solver that can be used to solve a wide range of banded linear systems. We utilize a row-oriented matrix decomposition method to divide the banded linear system into several correlated sub-linear systems and solve them on multiple GPUs collaboratively. We design a number of GPU and MPI optimizations to speedup inter-GPU and inter-machine communications. We evaluate the solver on Poisson equation and advection diffusion equation as well as several other banded linear systems. The solver achieves a speedup of more than 21 times running from 6 to 192 GPUs on the XSEDE's Keeneland supercomputer and because of small communication overhead, can scale upto 32 GPUs on Amazon EC2 with relatively slow ethernet network.
机译:有效地解决带状线性系统对于许多科学和工程应用很重要。当前的求解器仅在可以划分为独立子系统的线性系统上才能实现良好的可伸缩性。在本文中,我们提出了一种基于GPU的可扩展双共轭梯度稳定求解器,该求解器可用于求解各种带状线性系统。我们利用面向行的矩阵分解方法将带状线性系统划分为几个相关的子线性系统,并在多个GPU上协同解决。我们设计了许多GPU和MPI优化,以加速GPU间和机器间的通信。我们评估泊松方程和对流扩散方程以及其他几个带状线性系统的求解器。该解决方案在XSEDE的Keeneland超级计算机上从6到192个GPU运行,可实现21倍以上的加速,并且由于通信开销较小,因此在具有相对较慢的以太网网络的Amazon EC2上可以扩展到32个GPU。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号