首页> 外文期刊>SIAM Journal on Scientific Computing >Weighted matrix ordering and parallel banded preconditioners for iterative linear system solvers
【24h】

Weighted matrix ordering and parallel banded preconditioners for iterative linear system solvers

机译:线性迭代系统求解器的加权矩阵排序和并行带预处理器

获取原文
获取原文并翻译 | 示例
           

摘要

The emergence of multicore architectures and highly scalable platforms motivates the development of novel algorithms and techniques that emphasize concurrency and are tolerant of deep memory hierarchies, as opposed to minimizing raw FLOP counts. While direct solvers are reliable, they are often slow and memory-intensive for large problems. Iterative solvers, on the other hand, are more efficient but, in the absence of robust preconditioners, lack reliability. While preconditioners based on incomplete factorizations (whenever they exist) are effective for many problems, their parallel scalability is generally limited. In this paper, we advocate the use of banded preconditioners instead and introduce a reordering strategy that enables their extraction. In contrast to traditional bandwidth reduction techniques, our reordering strategy takes into account the magnitude of the matrix entries, bringing the heaviest elements closer to the diagonal, thus enabling the use of banded preconditioners. When used with effective banded solvers-in our case, the Spike solver-we show that banded preconditioners (i) are more robust compared to the broad class of incomplete factorization-based preconditioners, (ii) deliver higher processor performance,resulting in faster time to solution, and (iii) scale to larger parallel configurations. We demonstrate these results experimentally on a large class of problems selected from diverse application domains.
机译:多核体系结构和高度可扩展的平台的出现推动了新颖算法和技术的发展,这些算法和技术强调并发性,并能容忍深层存储层次结构,而不是将原始FLOP计数降至最低。尽管直接求解器是可靠的,但对于大型问题,它们通常速度较慢且占用大量内存。另一方面,迭代求解器效率更高,但是在缺少健壮的预处理器的情况下,可靠性不足。尽管基于不完全分解的预处理器(无论何时存在)对于许多问题都很有效,但它们的并行可伸缩性通常受到限制。在本文中,我们提倡使用带状条件预处理器,并介绍一种重新排序策略以使其能够提取。与传统的带宽减少技术相比,我们的重排序策略考虑了矩阵项的大小,使最重的元素更靠近对角线,从而可以使用带状预处理器。当与有效的带状求解器结合使用时(在我们的示例中为Spike求解器),我们证明带状预处理器(i)与基于不完全分解的广泛类别的预处理器相比更强大,(ii)提供更高的处理器性能,从而缩短了处理时间解决方案,以及(iii)扩展到更大的并行配置。我们在选自不同应用领域的一大类问题上通过实验证明了这些结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号