Parallel threshold-based ILU factorization

机译：基于并行阈值的ILU分解

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The sparse linear systems arising in finite element applications are commonly solved using iterative methods. In particular, as the size of these problems increases, the increased computational and memory requirements of these problems render in-core direct solution methods unusable, leaving iterative methods as the only viable alternative for solving these problems in core.The major computational kernels of an iterative method are (i) computation of preconditioner, (ii) multiplication of a sparse matrix with a vector, and (iii) application of the preconditioner. Threshold-based incomplete LU factorization have been found to be quite effective in preconditioning iterative system solvers [14]. However, because these factorizations allow the fill elements to be created dynamically, their parallel formulations had not been well understood, and they have been considered to be unsuitable for distributed-memory parallel computers [13]. Furthermore, solution of the resulting sparse triangular system (which is required for the application of the preconditioner) is generally more difficult to parallelize than the multiplication of a sparse matrix with a vector.In this paper we show that highly parallel graph partitioning algorithms in conjunction with parallel algorithms for computing maximal independent sets can be used to develop scalable parallel formulations of incomplete factorization algorithms. We present a highly parallel formulation of the ILUT factorization algorithm [14] for distributed memory parallel computers. This algorithm uses our parallel multilevel k-way graph partitioning algorithm [6,8] in conjunction with a parallel maximal independent subset algorithm to parallelize both the factorization as well as the solution of the resulting triangular factors. We also present a modified ILUT factorization algorithm (ILUT*) that requires less time and is more scalable than ILUT. Our experiments on Cray T3D show that our parallel ILUT* algorithm achieve a high degree of concurrency, and when used as a preconditioner, it is comparable in quality to the unmodified ILUT algorithm. Furthermore, our experiments using the GMRES iterative solver show that the amount of time spent in computing the factorization using the ILUT* algorithm is usually much less than the amount of time required to solve the systems.

机译：通常使用迭代方法解决有限元应用中出现的稀疏线性系统。特别是，随着这些问题规模的增大，这些问题的不断增长的计算和内存需求使得内核内直接解决方法无法使用，从而使迭代方法成为解决内核中这些问题的唯一可行选择。迭代方法是（i）预处理器的计算，（ii）稀疏矩阵与矢量的相乘，以及（iii）预处理器的应用。已经发现基于阈值的不完全LU分解在预处理迭代系统求解器中非常有效[14]。但是，由于这些分解可以动态创建填充元素，因此对其并行表示还没有很好的理解，因此认为它们不适用于分布式内存并行计算机[13]。此外，与将稀疏矩阵与向量相乘相比，所得稀疏三角系统（应用预处理器所需的）的解决方案通常更难并行化。带有用于计算最大独立集的并行算法的算法可用于开发不完整分解算法的可扩展并行公式。我们为分布式内存并行计算机提出了ILUT因数分解算法[14]的高度并行表示。该算法使用我们的并行多级 k 方向图分区算法[6,8]以及并行的最大独立子集算法来并行化分解和生成的三角因子的求解。我们还提出了一种经过改进的ILUT分解算法（ILUT *），该算法比ILUT所需的时间更少且可扩展性更高。我们在Cray T3D上进行的实验表明，我们的并行ILUT *算法实现了高度的并发性，并且在用作预处理器时，其质量与未经修改的ILUT算法相当。此外，我们使用GMRES迭代求解器进行的实验表明，使用ILUT *算法计算因式分解所花费的时间通常比解决系统所需的时间少得多。 展开▼

著录项

来源
《ACM/IEEE conference on Supercomputing》|1997年|P.1-24|共24页

会议地点

作者
George Karypis; Vipin Kumar;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类计算技术、计算机技术;

关键词

引文网络

参考文献

引证文献

共引文献

同被引文献

二级参考文献

二级引证文献

相似文献

外文文献

中文文献

专利

1. PARILUT-A NEW PARALLEL THRESHOLD ILU FACTORIZATION [J] . Anzt Hartwig, Chow Edmond, Dongarra Jack SIAM Journal on Scientific Computing . 2018,第4期

机译：parilut-newpartal阈值ILU分解

2. Parallel Newton two-stage methods based on ILU factorizations for nonlinear systems [J] . Arnal J, Migallon H, Migallon V, Numerical linear algebra with applications . 2006,第7期

机译：基于ILU分解的非线性系统并行牛顿两阶段方法

3. A parallel multistage ILU factorization based on a hierarchical graph decomposition [J] . Henon P, Saad Y SIAM Journal on Scientific Computing . 2006,第6期

机译：基于层次图分解的并行多级ILU分解

4. Parallel Threshold-based ILU Factorization [C] . Karypis G., Kumar V. Supercomputing, ACM/IEEE 1997 Conference . -1

机译：基于并行阈值的ILU分解

5. Parallel multilevel block ILU preconditioning techniques for solving general sparse linear systems. [D] . Shen, Chi. 2004

机译：解决通用稀疏线性系统的并行多级块ILU预处理技术。

6. A Comparative Study of the Application of Fluorescence Excitation-Emission Matrices Combined with Parallel Factor Analysis and Nonnegative Matrix Factorization in the Analysis of Zn Complexation by Humic Acids [O] . Patrycja Boguta, Piotr M. Pieczywek, Zofia Sokołowska 2016

机译：荧光激发-发射矩阵与平行因子分析和非负矩阵分解相结合在腐植酸锌络合分析中的比较研究

7. Parallel Threshold-based ILU Factorization * [O] . 2008

机译：基于并行阈值的ILU分解*

8. Parallel ILU ordering and convergence relationships: Numerical experiments [R] . Hysom, David, Pothen, Alex 2000

机译：并行ILU排序和收敛关系：数值实验

1. 排序对重叠区域分解型并行ILU的影响分析 [J] . 吴建平 ,张理论 ,马怀发 . 计算机工程与应用 . 2012,第033期

2. ILU分解的两步多重分裂迭代法的收敛性研究 [J] . 江山 . 阜阳师范学院学报（自然科学版） . 2012,第002期

3. 基于混合阈值的清除重复间隔阈值经验模态分解去噪方法 [J] . 王平根 ,吕敬祥 . 井冈山大学学报（自然科学版） . 2019,第006期

4. 基于混合阈值的清除重复间隔阈值经验模态分解去噪方法 [J] . 王平根 ,吕敬祥 . 井冈山大学学报 . 2019,第006期

5. 基于改进阈值的小波分解和经验模态分解的人体脉搏信号滤波算法研究 [J] . 麻芙阳 ,谢锐 . 电子产品世界 . 2014,第002期

6. 基于经验模态分解联合小波阈值的自适应心电信号基线漂移噪声去除 [C] . 朝乐蒙 ,梁莹 ,夏慧琳 . 中国医学装备大会暨2021医学装备展览会 . 2018

7. 多水平ILU分解及在电磁计算中的应用研究 [A] . 郑振裥 . 2012

1. 用于分布式稀疏线性系统中改进的并行ILU分解的系统和方法 [P] . 中国专利： CN105320566A . 2016-02-10

2. 用于分布式稀疏线性系统中改进的并行ILU分解的系统和方法 [P] . 中国专利： CN102334110B . 2014.09.24

3. SYSTEMS AND METHODS FOR IMPROVED PARALLEL ILU FACTORIZATION IN DISTRIBUTED SPARSE LINEAR SYSTEMS [P] . 外国专利： EP2353101B1 . 2017-01-04

机译：分布式稀疏线性系统中改善并行ILU分解的系统和方法

4. Systems and methods for improved parallel ILU factorization in distributed sparse linear systems [P] . 外国专利： US8813053B2 . 2014-08-19

机译：分布式稀疏线性系统中改进并行ILU分解的系统和方法

5. SYSTEMS AND METHODS FOR IMPROVED PARALLEL ILU FACTORIZATION IN DISTRIBUTED SPARSE LINEAR SYSTEMS [P] . 外国专利： EP2353101A4 . 2013-06-05

机译：分布式稀疏线性系统中改善并行ILU分解的系统和方法

相关主题

Parallel threshold-based ILU factorization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅