...
首页> 外文期刊>Journal of Parallel and Distributed Computing >Toward an automatic parallelization of sparse matrix computations
【24h】

Toward an automatic parallelization of sparse matrix computations

机译:走向稀疏矩阵计算的自动并行化

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a generic method of automatic parallelization for sparse matrix computation. This method is based on both a refinement of the data-dependence test proposed by Bernstein and an inspector-executor scheme which is specialized to each input program of the compiler. This analysis mixes compilation process and run-time process.The sparsity of underlying data-structure determines a specific parallelism which increases the degree of parallelism of an algorithm. Such a source of parallelism had already been applied to many numerical algorithms such as the usual Cholesky factorization or LU-decomposition algorithms considered as the gold standards of parallelization based on sparsity. The standard automatic parallelization method cannot tackle such source of parallelism because it is based on the value of cells arrays and not merely on the memory addressing function.Addressing the automatization of this parallelism requires to develop a mixed compile-time and runtime approach integrated in a inspector-executor process. The compilation step provides a dedicated inspector devoted to the analyzed program. The inspector computes the dependence graph at runtime which allows a dynamic parallelization of the execution.As expressed just before, the generic scheme developed in this paper follows the design principles which have been applied, but at each time in an ad hoc way, to many sparse parallelization of numerical algorithms such as Cholesky algorithm. As far as we know, no general formal framework has been proposed to automate such a method of sparse parallelization. In this paper, we propose a generic framework of sparse parallelization (i.e. numerical program independent) which can be applied to any numerical programs satisfying the usual syntactic constraints of parallelization. (c) 2004 Elsevier Inc. All rights reserved.
机译:在本文中,我们提出了一种用于稀疏矩阵计算的自动并行化方法。该方法既基于Bernstein提出的数据依赖测试的改进,又基于专门针对编译器每个输入程序的检查执行器方案。该分析将编译过程和运行时过程混合在一起。基础数据结构的稀疏性决定了特定的并行性,从而增加了算法的并行度。这种并行性的来源已经被应用于许多数值算法,例如通常的Cholesky分解或LU分解算法,这些算法被认为是基于稀疏性的并行化的黄金标准。标准的自动并行化方法不能解决这种并行性问题,因为它基于单元数组的值,而不仅仅是基于内存寻址功能。要解决这种并行性的自动化问题,需要开发一种集成在编译器中的混合编译时和运行时方法。检查员-执行者过程。编译步骤提供了专用于检查程序的检查员。检查器在运行时计算依赖关系图,该依赖关系图允许执行的动态并行化。如前所述,本文开发的通用方案遵循已应用的设计原理,但是每次都以临时方式应用到许多数值算法(例如Cholesky算法)的稀疏并行化。据我们所知,尚未提出任何通用的形式化框架来自动化这种稀疏并行化方法。在本文中,我们提出了一个稀疏并行化的通用框架(即独立于数字程序),该框架可以应用于满足并行化通常语法约束的任何数字程序。 (c)2004 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号