首页> 外文学位 >New sequential and scalable parallel algorithms for incomplete factor preconditioning.
【24h】

New sequential and scalable parallel algorithms for incomplete factor preconditioning.

机译:用于不完整因子预处理的新的顺序可扩展并行算法。

获取原文
获取原文并翻译 | 示例

摘要

The solution of large, sparse, linear systems of equations Ax = b is an important kernel, and the dominant term with regard to execution time, in many applications in scientific computing. The large size of the systems of equations being solved currently (millions of unknowns and equations) requires iterative solvers on parallel computers. Preconditioning, which is the process of translating a linear system into a related system that is easier to solve, is widely used to reduce solution time and is sometimes required to ensure convergence. Level-based preconditioning (ILU(ℓ)) has long been used in serial contexts and is widely recognized as robust and effective for a wide range of problems. However, the method has long been regarded as an inherently sequential technique. Parallelism, it has been thought, can be achieved primarily at the expense of increased iterations. We dispute these claims.; The first half of this dissertation takes an in-depth look at structurally based ILU(ℓ) symbolic factorization. There are two definitions of fill level, “sum” and “max,” that have been proposed. Hitherto, these definitions have been cast in terms of matrix terminology. We develop a sequence of lemmas and theorems that provide graph theoretic characterizations of both definitions; these characterizations are based on the static graph of a matrix, G(A).; Our Incomplete Fill Path Theorem characterizes fill levels per the sum definition; this is the definition that is used in most library implementations of the “classic” ILU(ℓ) factorization algorithm. Our theorem leads to several new graph-search algorithms that compute factors identical, or nearly identical, to those computed by the “classic” algorithm. Our analyses shows that the new algorithms have lower run time complexity than that of the previously existing algorithms for certain classes of matrices that are commonly encountered in scientific applications.; The second half of this dissertation presents a Parallel ILU algorithmic framework (PILU). This framework enables scalable parallel ILU preconditioning by combining concepts from domain decomposition and graph ordering. The framework can accommodate ILU(ℓ) factorization as well as threshold-based ILUT methods.; A model implementation of the framework, the Euclid library, was developed as part of this dissertation. This library was used to obtain experimental results for Poisson's equation, the Convection-Diffusion equation, and a nonlinear Radiative Transfer problem. The experiments, which were conducted on a variety of platforms with up to 400 CPUs, demonstrate that our approach is highly scalable for arbitrary ILU(ℓ) fill levels.
机译:在科学计算的许多应用中,方程式 Ax = b 的大型,稀疏线性系统的解决方案是重要的内核,并且是执行时间的主要术语。当前正在解决的大型方程系统(数百万个未知数和方程)需要并行计算机上的迭代求解器。预处理是将线性系统转换为易于解决的相关系统的过程,被广泛用于减少求解时间,有时需要进行预收敛以确保收敛。基于级别的预处理(ILU(ℓ))长期以来一直在串行环境中使用,并被广泛认为对各种问题都有效而有效。但是,该方法长期以来一直被视为一种固有的顺序技术。人们认为,并行性主要可以通过增加迭代次数来实现。我们对这些主张提出异议。本文的前半部分深入研究了基于结构的ILU(ℓ)符号分解。已经提出了填充水平的两个定义,即“总和”和“最大”。迄今为止,这些定义是根据矩阵术语进行的。我们开发了引理和定理的序列,提供了这两种定义的图论表征。这些特征基于矩阵 G A )的静态图。我们的不完整填充路径定理根据总和定义来表征填充水平;这是“经典” ILU(ℓ)因式分解算法的大多数库实现中使用的定义。我们的定理导致了几种新的图搜索算法,它们计算的因子与“经典”算法计算的因子相同或几乎相同。我们的分析表明,对于科学应用中常见的某些类别的矩阵,新算法的运行时复杂度比以前的现有算法低。本文的后半部分介绍了并行ILU算法框架(PILU)。该框架通过结合域分解和图排序的概念,实现了可扩展的并行ILU预处理。该框架可以适应ILU(ℓ)分解以及基于阈值的ILUT方法。作为本文的一部分,开发了该框架的模型实现,即Euclid库。该库用于获得泊松方程,对流扩散方程和非线性辐射传递问题的实验结果。在具有多达400个CPU的各种平台上进行的实验证明,我们的方法对于任意ILU(ℓ)填充级别具有高度可扩展性。

著录项

  • 作者

    Hysom, David A.;

  • 作者单位

    Old Dominion University.;

  • 授予单位 Old Dominion University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 160 p.
  • 总页数 160
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号