A CPU-GPU hybrid approach for the unsymmetric multifrontal method

Chenhan D. Yu; Weichung Wang; Danl Pierce

首页> 外文期刊>Parallel Computing >A CPU-GPU hybrid approach for the unsymmetric multifrontal method

【24h】

A CPU-GPU hybrid approach for the unsymmetric multifrontal method

机译：非对称多面方法的CPU-GPU混合方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Multifrontal is an efficient direct method for solving large-scale sparse and unsymmetric linear systems. The method transforms a large sparse matrix factorization process into a sequence of factorizations involving smaller dense frontal matrices. Some of these dense operations can be accelerated by using a graphic processing unit (GPU). We analyze the unsymmetric multifrontal method from both an algorithmic and implementational perspective to see how a GPU, in particular the NVIDIA Tesla C2070, can be used to accelerate the computations. Our main accelerating strategies include (i) performing BLAS on both CPU and GPU, (ii) improving the communication efficiency between the CPU and GPU by using page-locked memory, zero-copy memory, and asynchronous memory copy, and (iii) a modified algorithm that reuses the memory between different GPU tasks and sets thresholds to determine whether certain tasks be performed on the GPU. The proposed acceleration strategies are implemented by modifying UMFPACK, which is an unsymmetric multifrontal linear system solver. Numerical results show that the CPU-GPU hybrid approach can accelerate the unsymmetric multifrontal solver, especially for computationally expensive problems.

机译：多面是解决大规模稀疏和不对称线性系统的一种有效的直接方法。该方法将大型稀疏矩阵分解过程转换为一系列涉及较小密集前额矩阵的分解。通过使用图形处理单元（GPU）可以加速其中一些密集的操作。我们从算法和实现的角度分析了非对称多面方法，以了解如何使用GPU（尤其是NVIDIA Tesla C2070）来加速计算。我们的主要加速策略包括（i）在CPU和GPU上均执行BLAS；（ii）通过使用页面锁定存储器，零拷贝存储器和异步存储器拷贝来提高CPU和GPU之间的通信效率；以及（iii）a修改后的算法，可在不同的GPU任务之间重用内存并设置阈值，以确定是否在GPU上执行某些任务。提出的加速策略是通过修改UMFPACK来实现的，UMFPACK是一种不对称的多前沿线性系统求解器。数值结果表明，CPU-GPU混合方法可以加快非对称多面求解器的速度，特别是对于计算量大的问题。

著录项

来源
《Parallel Computing》 |2011年第12期|p.759-770|共12页
作者
Chenhan D. Yu; Weichung Wang; Danl Pierce;
展开▼
作者单位

Department of Mathematics. National Taiwan University, Taipei 10617, Taiwan;

Department of Mathematics. National Taiwan University, Taipei 10617, Taiwan;

MSC. Software Corporation, Glendale, CA 97203, USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
sparse and unsymmetric linear systems; multifrontal; CPU-GPU hybrid approach; parallel computing;

机译：稀疏和不对称线性系统;多边CPU-GPU混合方法;并行计算;

相似文献

外文文献
中文文献
专利

1. A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization [J] . Chen Yong, Jin Hai, Zheng Ran, Journal of signal processing systems for signal, image, and video technology . 2018,第1期

机译：稀疏Cholesky分解的CPU-GPU混合多面优化方法
2. Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver [J] . Chenhan D. Yu, Weichung Wang Computers & mathematics with applications . 2014,第7期

机译：用于优化混合CPU-GPU多前端求解器的性能模型和工作负载分配算法
3. Algorithm 832: UMFPACK V4.3―An Unsymmetric-Pattern Multifrontal Method [J] . TIMOTHY A. DAVIS ACM transactions on mathematical software . 2004,第2期

机译：算法832：UMFPACK V4.3-一种非对称模式的多面方法
4. Hybrid CPU-GPU Generation of the Hamiltonian and Overlap Matrices in FLAPW Methods [C] . Diego Fabregat-Traver, Davor Davidovic, Markus Hohnerbach, JARA High-Performance Computing Symposium . 2017

机译：混合CPU-GPU在PLAPW方法中产生Hamiltonian和重叠矩阵
5. Refining Crystal Size Distributions and Kinetic Histories Using Automated Scanning Electron Microscopy and Manual Methods: A Hybrid Approach [D] . Cone, Kim A. 2018

机译：使用自动扫描电子显微镜和手动方法完善晶体尺寸分布和动力学历史：一种混合方法
6. Implementation of a Hybrid Educational Program between the Model of Personal and Social Responsibility (TPSR) and the Teaching Games for Understanding (TGfU) in Physical Education and Its Effects on Health: An Approach Based on Mixed Methods [O] . Gregorio García-Castejón, Oleguer Camerino, Marta Castañer, 2021

机译：在体育教育中的个人和社会责任（TPSR）和理解教学博弈与理解教学中的实施及其对健康影响的影响：一种基于混合方法的方法
7. A combined unifrontal/multifrontal method for unsymmetric sparse matrices [O] . Davis, T A, Duff, I S 1997

机译：非对称稀疏矩阵的组合单面/多面方法
8. Combined Unifrontal/Multifrontal Method for Unsymmetric Sparse Matrices [R] . Davis, T. A., Duff, I. S. 1997

机译：非对称稀疏矩阵的非正面/多面联合方法

A CPU-GPU hybrid approach for the unsymmetric multifrontal method

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅