Performance of Parallel Sparse Matrix-Vector Multiplications in Linear Solves on Multiple GPUs

机译：多个GPU上线性求解中并行稀疏矩阵 - 矢量乘法的性能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern numerical simulations often require solving extremely large sparse linear systems. Solving these linear systems using Krylov iterative methods requires repeated sparse matrix-vector multiplications which can be the most computationally expensive part of the simulation. Since Graphics Processing Units (GPUs) provide a significant increase in floating point operations per second and memory bandwidth over conventional Central Processing Units (CPUs), performing sparse matrix-vector multiplications with these co-processors can decrease the amount of time required to solve a given linear system. In this paper, we investigate the performance of sparse matrix-vector multiplications across multiple GPUs. This is performed in the context of the solution of symmetric positive-definite linear systems using a conjugate-gradient iteration preconditioned with a least-squares polynomial preconditioner using the PETSc library.

机译：现代数值模拟通常需要解决极大的稀疏线性系统。使用Krylov迭代方法求解这些线性系统需要重复的稀疏矩阵 - 矢量乘法，这可以是模拟最昂贵的部分。由于图形处理单元（GPU）在传统的中央处理单元（CPU）上提供浮点操作和内存带宽的浮点操作显着增加，并且利用这些协处理器执行稀疏矩阵矢量乘法可以降低求解A所需的时间量给定线性系统。在本文中，我们调查多个GPU跨越稀疏矩阵矢量乘法的性能。这在使用PETSC库的使用最小二乘多项式前提者的缀合物梯度迭代的对称正定线性系统的解决方案中执行。

著录项

来源
《Symposium on Application Accelerators in High Performance Computing》|2012年||共4页
会议地点
作者
Jamroz Ben; Mullowney Paul;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Hybrid-Parallel Sparse Matrix-Vector Multiplication and Iterative Linear Solvers with the communication library GPI [J] . Dimitar Stoyanov, Franz-Josef Pfreundt WSEAS Transactions on Information Science and Applications . 2014,第Null期

机译：带有通信库GPI的混合并行稀疏矩阵矢量乘法和迭代线性求解器
2. Performance Prediction Based on Statistics of Sparse Matrix-Vector Multiplication on GPUs [J] . Ruixing Wang, Tongxiang Gu, Ming Li Journal of Computer and Communications . 2017,第6期

机译：基于GPU稀疏矩阵矢量乘法统计的性能预测
3. Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs [J] . Abdelfattah Ahmad, Ltaief Hatem, Keyes David, Concurrency and computation: practice and experience . 2016,第12期

机译：使用GPU对基于PDE的多组件应用的稀疏矩阵矢量乘法的性能优化
4. Performance of Parallel Sparse Matrix-Vector Multiplications in Linear Solves on Multiple GPUs [C] . Jamroz Ben, Mullowney Paul 2012 Symposium on Application Accelerators in High Performance Computing. . 2012

机译：多个GPU上线性求解中并行稀疏矩阵矢量乘法的性能
5. Analysis of High Performance Sparse Matrix-Vector Multiplication for Small Finite Fields [D] . Lambert, Matthew A. 2020

机译：小型有限字段高性能稀疏矩阵矢量乘法分析
6. Parallelized pairwise sequence alignment using CUDA on multiple GPUs [O] . Sungbo Jung 2009

机译：在多个GPU上使用CUDA进行并行的成对序列比对
7. Performance Analysis of Sparse Matrix-Vector Multiplication (SpMV) on Graphics Processing Units (GPUs) [O] . Sarah AlAhmadi, Thaha Mohammed, Aiiad Albeshri, 2020

机译：稀疏矩阵矢量乘法（SPMV）对图形处理单元（GPU）的性能分析

Performance of Parallel Sparse Matrix-Vector Multiplications in Linear Solves on Multiple GPUs

摘要

著录项

相似文献

相关主题

期刊订阅