Optimization of quasi-diagonal matrix-vector multiplication on GPU

Wangdong Yang; Kenli Li; Yan LiuLin ShiLanjun Wan

首页> 外文期刊>International journal of high performance computing applications >Optimization of quasi-diagonal matrix-vector multiplication on GPU

【24h】

Optimization of quasi-diagonal matrix-vector multiplication on GPU

机译：Optimization of quasi-diagonal matrix-vector multiplication on GPU

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra, which is an important issue in scientific computing and engineering practice. Much effort has been put into accelerating SpMV, and a few parallel solutions have been proposed. This paper focuses on a special type of SpMV, namely sparse quasi-diagonal matrix-vector multiplication (SQDMV). The sparse quasi-diagonal matrix is the key to solving many differential equations, and very little research has been done in this field. This paper discusses data structures and algorithms for SQDMV that are efficiently implemented on the compute unified device architecture (CUDA) platform for the fine-grained parallel architecture of the graphics processing unit (GPU). A new diagonal storage format, a hybrid of the diagonal format (DLA) and the compressed sparse row format (CSR) (HDC) will be presented, which overcomes the inefficiency of DLA in storing irregular matrices and the imbalances of CSR in storing non-zero elements. Furthermore, HDC can adjust the storage bandwidth of the diagonal to adapt to different discrete degrees of sparse matrix, so as to get a higher compression ratio than DLA and CSR, and reduce the computational complexity. Our implementation in a GPU shows that the performance of HDC is better than that of other formats, especially for matrices with some discrete points outside the main diagonal. In addition, we combine the different parts of HDC to make a unified kernel to get a better compression ratio and a higher speedup ratio in the GPU.

著录项

来源
《International journal of high performance computing applications》 |2014年第2期|183-195|共13页
作者
Wangdong Yang; Kenli Li; Yan LiuLin ShiLanjun Wan;
展开▼
作者单位

School of Information Science and Engineering, Hunan City University, China,College of Information Science and Engineering, Hunan University, China,National Supercomputing Centre in Changsha, China;

School of Information Science and Engineering, Hunan City University, China,College of Information Science and Engineering, Hunan University, Changsha, China,National Supercomputing Centre in Changsha, China;

College of Information Science and Engineering, Hunan University, ChinaCollege of Information Science and Engineering, Hunan University, China,National Supercomputing Centre in Changsha, China;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类计算技术、计算机技术;
关键词
Graphics processing unit (GPU); sparse matrix; sparse matrix-vector multiplication (SpMV); compute unified device architecture (CUDA); quasi-diagonal matrix;

Optimization of quasi-diagonal matrix-vector multiplication on GPU

摘要

著录项

相关主题

期刊订阅