首页> 外文期刊>Journal of supercomputing >Solving finite difference linear systems on GPUs: CUDA based Parallel Explicit Preconditioned Biconjugate Conjugate Gradient type Methods
【24h】

Solving finite difference linear systems on GPUs: CUDA based Parallel Explicit Preconditioned Biconjugate Conjugate Gradient type Methods

机译:在GPU上求解有限差分线性系统:基于CUDA的并行显式预处理双共轭共轭梯度类型方法

获取原文
获取原文并翻译 | 示例
           

摘要

During the last decades, explicit approximate inverse preconditioning methods have been used for efficiently solving sparse linear systems on multiprocessor systems. The effectiveness of explicit approximate inverse preconditioning schemes relies on the use of efficient preconditioners that are close approximants to the coefficient matrix and are fast to compute in parallel. A new parallel computational technique is proposed for the parallelization of the explicit preconditioned conjugate gradient type method on a Graphics Processing Unit (GPU). The proposed parallel methods have been implemented using Compute Unified Device Architecture (CUDA) developed by NVIDIA. The inherently parallel operations between vectors and matrices involved in the explicit preconditioned biconjugate conjugate gradient type schemes exhibit significant amounts of loop-level parallelism because of the matrix-vector and the vector-vector products that can lead to high performance gain on the GPU systems, specifically designed for such computations. Finally, numerical results for the performance of the explicit preconditioned biconjugate conjugate gradient type method for solving characteristic two dimensional boundary value problems, using the finite difference method, on a massive multiprocessor interface on a GPU are presented. The CUDA implementation issues of the proposed method are also discussed.
机译:在最近的几十年中,显式近似逆预处理方法已用于有效地解决多处理器系统上的稀疏线性系统。显式近似逆预处理方案的有效性取决于有效预处理器的使用,该预处理器与系数矩阵非常接近,并且可以快速并行计算。提出了一种新的并行计算技术,用于在图形处理单元(GPU)上对显式预处理共轭梯度类型方法进行并行化。所建议的并行方法已使用NVIDIA开发的Compute Unified Device Architecture(CUDA)实施。由于矩阵向量和向量向量乘积可导致GPU系统获得高性能,因此显式的预条件双共轭共轭梯度类型方案中涉及的向量和矩阵之间固有的并行操作表现出大量的循环级并行性。专为此类计算而设计。最后,给出了在GPU上的大规模多处理器接口上使用有限差分法解决特征二维边界值问题的显式预处理双共轭共轭梯度型方法的性能的数值结果。还讨论了该方法的CUDA实现问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号