首页> 外文期刊>Parallel Computing >The communication-hiding pipelined BiCGstab method for the parallel solution of large unsymmetric linear systems
【24h】

The communication-hiding pipelined BiCGstab method for the parallel solution of large unsymmetric linear systems

机译:大型非对称线性系统并行求解的隐藏通信的流水线BiCGstab方法

获取原文
获取原文并翻译 | 示例
           

摘要

A High Performance Computing alternative to traditional Krylov subspace methods, pipelined Krylov subspace solvers offer better scalability in the strong scaling limit compared to standard Krylov subspace methods for large and sparse linear systems. The typical synchronization bottleneck is mitigated by overlapping time-consuming global communication phases with local computations in the algorithm. This paper describes a general framework for deriving the pipelined variant of any Krylov subspace algorithm. The proposed framework was implicitly used to derive the pipelined Conjugate Gradient (p-CG) method in Hiding global synchronization latency in the preconditioned Conjugate Gradient algorithm by P. Ghysels and W. Vanroose, Parallel Computing, 40(7):224-238, 2014. The pipelining framework is subsequently illustrated by formulating a pipelined version of the BiCGStab method for the solution of large unsymmetric linear systems on parallel hardware. A residual replacement strategy is proposed to account for the possible loss of attainable accuracy and robustness by the pipelined BiCGStab method. It is shown that the pipelined algorithm improves scalability on distributed memory machines, leading to significant speedups compared to standard preconditioned BiCGStab. (C) 2017 Elsevier B.V. All rights reserved.
机译:流水线式Krylov子空间求解器是传统Krylov子空间方法的一种高性能计算替代方法,与大型和稀疏线性系统的标准Krylov子空间方法相比,在强大的缩放比例限制下可提供更好的可伸缩性。通过将耗时的全局通信阶段与算法中的本地计算重叠,可以缓解典型的同步瓶颈。本文介绍了用于推导任何Krylov子空间算法的流水线变体的通用框架。 P.Ghysels和W.Vanroose并行计算,40(7):224-238,将所提出的框架隐式地用于推导隐藏在预条件共轭梯度算法中的全局同步延迟中的流水线共轭梯度(p-CG)方法, 2014年。流水线框架随后通过制定BiCGStab方法的流水线版本进行说明,以解决并行硬件上的大型非对称线性系统。提出了一种残差替换策略,以解决流水线BiCGStab方法可能获得的精度和鲁棒性损失的问题。结果表明,与标准的预处理BiCGStab相比,流水线算法提高了分布式存储计算机上的可伸缩性,从而显着提高了速度。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号