GPU-accelerated sparse matrices parallel inversion algorithm for large-scale power systems

Zhou Gan; Feng Yanjun; Bo Rui; Zhang Tao

首页> 外文期刊>International Journal of Electrical Power & Energy Systems >GPU-accelerated sparse matrices parallel inversion algorithm for large-scale power systems

【24h】

GPU-accelerated sparse matrices parallel inversion algorithm for large-scale power systems

机译：适用于大规模电力系统的GPU加速的稀疏矩阵并行反演算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art Graphics Processing Unit (CPU) has superior performances on float-pointing calculation and memory bandwidth, and therefore has great potential in many computationally intensive power system applications, one of which is the inversion of large-scale sparse matrix. It is a fundamental component for many power system analyses which requires to solve massive number of forward and backward substitution (F&B) subtasks and seems to be a good GPU-accelerated candidate application. By means of solving multiple F&B subtasks concurrently and a serial of performance tunings in compliance with GPU's architectures, we successfully develop a batch F&B algorithm on GPUs, which not only extracts the intra-level and intra-level parallelisms inside single F&B subtask but also explores a more regular parallelism among massive F&B subtasks, called inter-task parallelism. Case study on a 9241-dimension case shows that the proposed batch F&B solver consumes 2.92 mu s per forward substitution (FS) subtask when the batch size is equal to 3072, achieving 65 times speedup relative to KLU library. And on the basis the complete design process of GPU-based inversion algorithm is proposed. By offloading the tremendous computational burden to GPU, the inversion of 9241-dimension case consumes only 97 ms, which can achieve 8.1 times speedup relative to the 12-core CPU inversion solver based on KLU library. The proposed batch F&B solver is practically very promising in many other power system applications requiring solving massive F&B subtasks, such as probabilistic power flow analysis.

机译：最先进的图形处理单元（CPU）在浮点计算和内存带宽方面具有卓越的性能，因此在许多计算密集型电力系统应用中具有巨大潜力，其中之一是大规模稀疏矩阵的求逆。它是许多电源系统分析的基本组件，需要解决大量的前向和后向替换（F＆B）子任务，并且似乎是GPU加速的良好候选应用程序。通过同时解决多个F＆B子任务以及一系列符合GPU体系结构的性能调整，我们成功地在GPU上开发了批处理F＆B算法，该算法不仅提取单个F＆B子任务内部的内部和内部并行性，而且还探索了大型F＆B子任务之间更规则的并行性，称为任务间并行性。对9241维案例的案例研究表明，当批处理大小等于3072时，建议的批处理F＆B求解器每个前向替换（FS）子任务消耗2.92μs，相对于KLU库，实现了65倍的加速。在此基础上，提出了基于GPU的反演算法的完整设计过程。通过将巨大的计算负担转移给GPU，9241维案例的反转仅消耗97毫秒，相对于基于KLU库的12核CPU反转求解器而言，它可以实现8.1倍的加速。在许多其他需要解决大量F＆B子任务（例如概率潮流分析）的电力系统应用中，拟议的F＆B求解器在实践中非常有前途。

著录项

来源
《International Journal of Electrical Power & Energy Systems》 |2019年第10期|34-43|共10页
作者
Zhou Gan; Feng Yanjun; Bo Rui; Zhang Tao;
展开▼
作者单位

Southeast Univ, Sch Elect Engn, Nanjing 210096, Jiangsu, Peoples R China;

Southeast Univ, Sch Elect Engn, Nanjing 210096, Jiangsu, Peoples R China;

Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA;

State Grid Anshan Elect Power Supply Co, Anshan 114000, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Inversion; Forward substitution; Backward substitution; Spares matrix; GPU; Accelerated; Parallelism; Power flow;

机译：反演;正向替换;向后替换;备用矩阵;GPU;加速;并行;潮流;

相似文献

外文文献
中文文献
专利

1. A fast parallel algorithm for selected inversion of structured sparse matrices with application to 2D electronic structure calculations [J] . Lin L., Yang C., Lu J., SIAM Journal on Scientific Computing . 2011,第3a4期

机译：一种用于结构化稀疏矩阵的选择反演的快速并行算法及其在二维电子结构计算中的应用
2. Parallel algorithms for solving linear systems with sparse triangular matrices [J] . Jan Mayer Computing. Archives for Informatics and Numerical Computation . 2009,第4期

机译：求解具有稀疏三角矩阵的线性系统的并行算法
3. Parallel algorithms for solving linear systems with sparse triangular matrices [J] . Jan Mayer Computing . 2009,第4期

机译：求解具有稀疏三角矩阵的线性系统的并行算法
4. A parallel Gauss-Seidel algorithm for sparse power system matrices [C] . D. P. Koester, S. Ranka, G. C. Fox ACM/IEEE conference on Supercomputing . 1994

机译：稀疏电力系统矩阵的并行高斯-赛德尔算法
5. A Comparative Study of Collaborative Filtering Recommendation Systems Using Algorithms to Impute Large Sparse Matrices. [D] . Lindo, Steven Christopher. 2016

机译：使用算法插补大稀疏矩阵的协同过滤推荐系统的比较研究。
6. Large-scale medical image annotation with crowd-powered algorithms [O] . Eric Heim, Tobias Roß, Alexander Seitel, 2018

机译：使用人群驱动算法的大规模医学图像注释
7. A Parallel Gauss-Seidel Algorithm for Sparse Power System Matrices [O] . D. P. Koester, S. Ranka, G. C. Fox 1994

机译：稀疏电力系统矩阵的并行Gauss-seidel算法
8. Fast Parallel Algorithm for Selected Inversion of Structured Sparse Matrices with Application to 2D Electronic Calculations [R] . Lin, L., Yang, C., Lu, J., 2009

机译：结构稀疏矩阵选择反演的快速并行算法及其在二维电子计算中的应用

GPU-accelerated sparse matrices parallel inversion algorithm for large-scale power systems

摘要

著录项

相似文献

相关主题

期刊订阅