首页> 外文学位 >New sequential and scalable parallel algorithms for incomplete factor preconditioning.

【24h】

New sequential and scalable parallel algorithms for incomplete factor preconditioning.

机译：用于不完整因子预处理的新的顺序可扩展并行算法。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The solution of large, sparse, linear systems of equations Ax = b is an important kernel, and the dominant term with regard to execution time, in many applications in scientific computing. The large size of the systems of equations being solved currently (millions of unknowns and equations) requires iterative solvers on parallel computers. Preconditioning, which is the process of translating a linear system into a related system that is easier to solve, is widely used to reduce solution time and is sometimes required to ensure convergence. Level-based preconditioning (ILU(ℓ)) has long been used in serial contexts and is widely recognized as robust and effective for a wide range of problems. However, the method has long been regarded as an inherently sequential technique. Parallelism, it has been thought, can be achieved primarily at the expense of increased iterations. We dispute these claims.; The first half of this dissertation takes an in-depth look at structurally based ILU(ℓ) symbolic factorization. There are two definitions of fill level, “sum” and “max,” that have been proposed. Hitherto, these definitions have been cast in terms of matrix terminology. We develop a sequence of lemmas and theorems that provide graph theoretic characterizations of both definitions; these characterizations are based on the static graph of a matrix, G(A).; Our Incomplete Fill Path Theorem characterizes fill levels per the sum definition; this is the definition that is used in most library implementations of the “classic” ILU(ℓ) factorization algorithm. Our theorem leads to several new graph-search algorithms that compute factors identical, or nearly identical, to those computed by the “classic” algorithm. Our analyses shows that the new algorithms have lower run time complexity than that of the previously existing algorithms for certain classes of matrices that are commonly encountered in scientific applications.; The second half of this dissertation presents a Parallel ILU algorithmic framework (PILU). This framework enables scalable parallel ILU preconditioning by combining concepts from domain decomposition and graph ordering. The framework can accommodate ILU(ℓ) factorization as well as threshold-based ILUT methods.; A model implementation of the framework, the Euclid library, was developed as part of this dissertation. This library was used to obtain experimental results for Poisson's equation, the Convection-Diffusion equation, and a nonlinear Radiative Transfer problem. The experiments, which were conducted on a variety of platforms with up to 400 CPUs, demonstrate that our approach is highly scalable for arbitrary ILU(ℓ) fill levels.

机译：在科学计算的许多应用中，方程式 Ax = b 的大型，稀疏线性系统的解决方案是重要的内核，并且是执行时间的主要术语。当前正在解决的大型方程系统（数百万个未知数和方程）需要并行计算机上的迭代求解器。预处理是将线性系统转换为易于解决的相关系统的过程，被广泛用于减少求解时间，有时需要进行预收敛以确保收敛。基于级别的预处理（ILU（＆ell;））长期以来一直在串行环境中使用，并被广泛认为对各种问题都有效而有效。但是，该方法长期以来一直被视为一种固有的顺序技术。人们认为，并行性主要可以通过增加迭代次数来实现。我们对这些主张提出异议。本文的前半部分深入研究了基于结构的ILU（＆ell;）符号分解。已经提出了填充水平的两个定义，即“总和”和“最大”。迄今为止，这些定义是根据矩阵术语进行的。我们开发了引理和定理的序列，提供了这两种定义的图论表征。这些特征基于矩阵 G （ A ）的静态图。我们的不完整填充路径定理根据总和定义来表征填充水平；这是“经典” ILU（＆ell;）因式分解算法的大多数库实现中使用的定义。我们的定理导致了几种新的图搜索算法，它们计算的因子与“经典”算法计算的因子相同或几乎相同。我们的分析表明，对于科学应用中常见的某些类别的矩阵，新算法的运行时复杂度比以前的现有算法低。本文的后半部分介绍了并行ILU算法框架（PILU）。该框架通过结合域分解和图排序的概念，实现了可扩展的并行ILU预处理。该框架可以适应ILU（＆ell;）分解以及基于阈值的ILUT方法。作为本文的一部分，开发了该框架的模型实现，即Euclid库。该库用于获得泊松方程，对流扩散方程和非线性辐射传递问题的实验结果。在具有多达400个CPU的各种平台上进行的实验证明，我们的方法对于任意ILU（＆ell;）填充级别具有高度可扩展性。

著录项

作者
Hysom, David A.;
展开▼
作者单位

Old Dominion University.;

展开▼
授予单位 Old Dominion University.;
学科 Computer Science.
学位 Ph.D.
年度 2001
页码 160 p.
总页数 160
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A scalable parallel algorithm for incomplete factor preconditioning [J] . Hysom D., Pothen A. SIAM Journal on Scientific Computing . 2001,第6期

机译：不完全因子预处理的可扩展并行算法
2. Sequential and parallel deficit scaling algorithms for minimum flow in bipartite networks [J] . LAURA CIUPALA, ELEONOR CIUREA WSEAS Transactions on Computers . 2008,第10a12期

机译：顺序和并行赤字缩放算法，用于双向网络中的最小流量
3. MODYLAS: A Highly Parallelized General-Purpose Molecular Dynamics Simulation Program for Large-Scale Systems with Long-Range Forces Calculated by Fast Multipole Method (FMM) and Highly Scalable Fine-Grained New Parallel Processing Algorithms [J] . Yoshimichi Andoh, Noriyuki Yoshii, Kazushi Fujimoto Journal of chemical theory and computation: JCTC . 2013,第7期

机译：MODYLAS：具有并行力的大型多用途通用分子动力学仿真程序，该程序由快速多极方法（FMM）和高度可扩展的细粒度新并行处理算法计算而得
4. Impact of two factors on several domain decomposition based parallel incomplete factorizations for the meso-scale simulation of concrete [C] . WU Jian-ping, ZHAO Jun, SONG Jun-qiang, International Conference on Information and Computing . 2010

机译：两个因素对基于多个域分解的基于域分解的混凝土中型尺度模拟的平行不完全因子
5. Parallelization of probabilistic sequential search algorithms. [D] . Jog, Prasanna Dattatreya. 1989

机译：概率顺序搜索算法的并行化。
6. Large-Scale Modeling of Epileptic Seizures: Scaling Properties of Two Parallel Neuronal Network Simulation Algorithms [O] . Lorenzo L. Pesce, Hyong C. Lee, Mark Hereld, 2013

机译：癫痫发作的大规模建模：两种并行神经元网络仿真算法的缩放性质。
7. A Scalable Parallel Algorithm for Incomplete Factor Preconditioning [O] . David Hysom, Alex Pothen 2000

机译：不完全因子预处理的可扩展并行算法
8. Sequential Quadratic Programming Algorithm Using an Incomplete Solution of theSubproblem [R] . Murray, W., Prieto, F. J. 1993

机译：利用子问题不完全解的序列二次规划算法

New sequential and scalable parallel algorithms for incomplete factor preconditioning.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅