A Configurable Architecture for Sparse LU Decomposition on Matrices with Arbitrary Patterns

Xinying Wang; Phillip H. Jones; Joseph Zambreno

首页> 外文期刊>Computer architecture news >A Configurable Architecture for Sparse LU Decomposition on Matrices with Arbitrary Patterns

【24h】

A Configurable Architecture for Sparse LU Decomposition on Matrices with Arbitrary Patterns

机译：具有任意模式的矩阵的稀疏LU分解的可配置体系结构

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Sparse LU decomposition has been widely used to solve sparse linear systems of equations found in many scientific and engineering applications, such as circuit simulation, power system modeling and computer vision. However, it is considered a computationally expensive factorization tool. While parallel implementations have been explored to accelerate sparse LU decomposition, irregular sparsity patterns often limit their performance gains. Prior FPGA-based accelerators have been customized to domain-specific sparsity patterns of pre-ordered symmetric matrices. In this paper, we present an efficient architecture for sparse LU decomposition that supports both symmetric and asymmetric sparse matrices with arbitrary sparsity patterns. The control structure of our architecture parallelizes computation and pivoting operations. Also, on-chip resource utilization is configured based on properties of the matrices being processed. Our experimental results show a 1.6 to 14x speedup over an optimized software implementation for benchmarks containing a wide range of sparsity patterns.

机译：稀疏LU分解已广泛用于求解稀疏线性方程组，这些方程组在许多科学和工程应用中都可以找到，例如电路仿真，电力系统建模和计算机视觉。但是，它被认为是计算上昂贵的因式分解工具。虽然已经探索了并行实现以加速稀疏LU分解，但不规则的稀疏模式通常会限制其性能提升。现有的基于FPGA的加速器已针对预定义的对称矩阵的特定于域的稀疏模式进行了定制。在本文中，我们提出了一种有效的稀疏LU分解体系结构，该体系结构支持具有任意稀疏模式的对称和非对称稀疏矩阵。我们架构的控制结构使计算和数据透视操作并行化。而且，基于正在处理的矩阵的属性来配置片上资源利用。我们的实验结果表明，针对包含多种稀疏模式的基准，优化软件实现了1.6至14倍的加速。

著录项

来源
《Computer architecture news》 |2015年第4期|76-81|共6页
作者
Xinying Wang; Phillip H. Jones; Joseph Zambreno;
展开▼
作者单位

Department of Electrical and Computer Engineering Iowa State University, Ames, Iowa, USA;

Department of Electrical and Computer Engineering Iowa State University, Ames, Iowa, USA;

Department of Electrical and Computer Engineering Iowa State University, Ames, Iowa, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Architecture; FPGA; Sparse LU decomposition; Crout method;

机译：建筑;FPGA;稀疏LU分解;克鲁特法;

相似文献

外文文献
中文文献
专利

1. Parallel LU factorization of sparse matrices on FPGA-based configurable computing engines [J] . Xiaofang Wang, Sotirios G. Ziavras Concurrency and Computation . 2004,第4期

机译：基于FPGA的可配置计算引擎上稀疏矩阵的并行LU分解
2. Randomized LU decomposition using sparse projections [J] . Aizenbud Yariv, Shabat Gil, Averbuch Amir Computers & mathematics with applications . 2016,第9期

机译：使用稀疏投影的随机LU分解
3. Data-sparse LU-decomposition preconditioning combined with multilevel fast multipole method for electromagnetic scattering problems [J] . Wan T., Chen R.S., Hu X.Q. Microwaves, Antennas & Propagation, IET . 2011,第11期

机译：数据稀疏LU分解预处理与多级快速多极子方法相结合解决电磁散射问题
4. PIVOTING STRATEGY FOR FAST LU DECOMPOSITION OF SPARSE BLOCK MATRICES [C] . Lukas Polok, Pavel Smrz Simulation Multi-Conference . 2017

机译：快速LU分解稀疏块矩阵的枢转策略
5. An Analysis of Pivot Strategies to Maintain Sparsity in the LU Decomposition of IPDG Method Applied to the Helmholtz Equation [D] . Severance, Ryan Samuel. 2019

机译：应用于Helmholtz方程的IPDG方法的LU分解中保持稀疏性的枢轴策略分析
6. Auto-Calibrated Parallel Imaging Reconstruction for Arbitrary Trajectories Using k-Space Sparse Matrices (kSPA) [O] . Chunlei Liu, Jian Zhang, Michael E. Moseley -1

机译：自动校准并行成像重建为任意轨迹使用的k空间稀疏矩阵（kspa）
7. Parallel LU Factorization of Sparse Matrices on FPGA-Based Configurable Computing Engines [O] . Xiaofang Wang, Sotirios G. Ziavras 2003

机译：基于FpGa的可配置计算引擎稀疏矩阵的并行LU分解
8. LU Factorization of Sequences of Identically Structured Sparse Matrices Within a Distributed Memory Environment. [R] . Hadfield, S. M. 1994

机译：分布式存储环境中相同结构稀疏矩阵序列的LU分解。

A Configurable Architecture for Sparse LU Decomposition on Matrices with Arbitrary Patterns

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅