...
首页> 外文期刊>Parallel Computing >Structure-adaptive parallel solution of sparse triangular linear systems
【24h】

Structure-adaptive parallel solution of sparse triangular linear systems

机译:稀疏三角线性系统的结构自适应并行解

获取原文
获取原文并翻译 | 示例
           

摘要

Solving sparse triangular systems of linear equations is a performance bottleneck in many methods for solving more general sparse systems. Both for direct methods and for many iterative preconditioners, it is used to solve the system or improve an approximate solution, often across many iterations. Solving triangular systems is notoriously resistant to parallelism, however, and existing parallel linear algebra packages appear to be ineffective in exploiting significant parallelism for this problem. We develop a novel parallel algorithm based on various heuristics that adapt to the structure of the matrix and extract parallelism that is unexploited by conventional methods. By analyzing and reordering operations, our algorithm can often extract parallelism even for cases where most of the nonzero matrix entries are near the diagonal. Our main parallelism strategies are: (1) identify independent rows, (2) send data earlier to achieve greater overlap, and (3) process dense off-diagonal regions in parallel. We describe the implementation of our algorithm in Charm++ and MPI and present promising experimental results on up to 512 cores of BlueGene/P, using numerous sparse matrices from real applications.
机译:解决线性方程组的稀疏三角系统是解决更通用的稀疏系统的许多方法中的性能瓶颈。对于直接方法和许多迭代式预处理器,它通常用于许多迭代中的系统求解或改进近似解决方案。众所周知,求解三角形系统具有抗并行性的功能,并且现有的并行线性代数包对于利用此问题的有效并行性似乎无效。我们基于各种启发式算法开发了一种新颖的并行算法,该算法适用于矩阵的结构并提取传统方法无法利用的并行性。通过分析和重新排序操作,即使对于大多数非零矩阵项都位于对角线附近的情况,我们的算法也通常可以提取并行度。我们的主要并行策略是:(1)识别独立的行,(2)尽早发送数据以实现更大的重叠,以及(3)并行处理密集的非对角线区域。我们描述了我们的算法在Charm ++和MPI中的实现,并使用来自实际应用程序的众多稀疏矩阵在BlueGene / P的多达512个内核上展示了很有希望的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号