Two-dimensional cache-oblivious sparse matrix-vector multiplication

A.N. Yzelman; Rob H. Bisseling

首页> 外文期刊>Parallel Computing >Two-dimensional cache-oblivious sparse matrix-vector multiplication

【24h】

Two-dimensional cache-oblivious sparse matrix-vector multiplication

机译：二维高速缓存可忽略的稀疏矩阵矢量乘法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In earlier work, we presented a one-dimensional cache-oblivious sparse matrix-vector (SpMV) multiplication scheme which has its roots in one-dimensional sparse matrix partitioning. Partitioning is often used in distributed-memory parallel computing for the SpMV multiplication, an important kernel in many applications. A logical extension is to move towards using a two-dimensional partitioning. In this paper, we present our research in this direction, extending the one-dimensional method for cache-oblivious SpMV multiplication to two dimensions, while still allowing only row and column permutations on the sparse input matrix. This extension requires a generalisation of the compressed row storage data structure to a block-based data structure, for which several variants are investigated. Experiments performed on three different architectures show further improvements of the two-dimensional method compared to the one-dimensional method, especially in those cases where the one-dimensional method already provided significant gains. The largest gain obtained by our new reordering is over a factor of 3 in SpMV speed, compared to the natural matrix ordering.

机译：在较早的工作中，我们提出了一维高速缓存可忽略的稀疏矩阵矢量（SpMV）乘法方案，该方案起源于一维稀疏矩阵分区。分区通常在SpMV乘法的分布式内存并行计算中使用，SpMV乘法是许多应用程序中的重要内核。逻辑扩展是朝着使用二维分区的方向发展。在本文中，我们在此方向上介绍了我们的研究，将一维方法用于不让高速缓存使用的SpMV乘法扩展到了二维，同时仍然只允许稀疏输入矩阵上的行和列排列。此扩展要求将压缩的行存储数据结构推广到基于块的数据结构，为此将研究几种变体。在三种不同体系结构上进行的实验表明，与一维方法相比，二维方法有了进一步的改进，尤其是在一维方法已经提供了可观收益的情况下。与自然矩阵排序相比，我们新的重新排序获得的最大增益是SpMV速度的3倍以上。

著录项

来源
《Parallel Computing》 |2011年第12期|p.806-819|共14页
作者
A.N. Yzelman; Rob H. Bisseling;
展开▼
作者单位

Mathematical Institute, Utrecht University, P.O. Box 80010, 3508 TA Utrecht. The Netherlands;

Mathematical Institute, Utrecht University, P.O. Box 80010, 3508 TA Utrecht. The Netherlands;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
matrix-vector multiplication; sparse matrix; parallel computing; recursive bipartitioning; fine-grain; cache-oblivious;

机译：矩阵向量乘法稀疏矩阵并行计算;递归分割;细颗粒缓存不明显;

相似文献

外文文献
中文文献
专利

1. A two-dimensional data distribution method for parallel sparse matrix-vector multiplication [J] . Vastenhouw B, Bisseling RH SIAM Review . 2005,第1期

机译：并行稀疏矩阵-矢量乘法的二维数据分配方法
2. GPU accelerated sparse matrix-vector multiplication and sparse matrix-transpose vector multiplication [J] . Yuan Tao, Yangdong Deng, Shuai Mu, Concurrency and computation: practice and experience . 2015,第14期

机译：GPU加速的稀疏矩阵-向量乘法和稀疏矩阵-转置向量乘法
3. Cache-Oblivious Sparse Matrix--Vector Multiplication by Using Sparse Matrix Partitioning Methods [J] . A. N. Yzelman., Rob H. Bisseling. SIAM Journal on Scientific Computing . 2010,第4期

机译：高速缓存不可忽略的稀疏矩阵-使用稀疏矩阵划分方法的矢量乘法
4. Threaded Accurate Matrix-Matrix Multiplications with Sparse Matrix-Vector Multiplications [C] . Shuntaro Ichimura, Takahiro Katagiri, Katsuhisa Ozaki, IEEE International Parallel and Distributed Processing Symposium Workshops . 2018

机译：带稀疏矩阵-矢量乘法的线程化精确矩阵-矩阵乘法
5. Analysis of High Performance Sparse Matrix-Vector Multiplication for Small Finite Fields [D] . Lambert, Matthew A. 2020

机译：小型有限字段高性能稀疏矩阵矢量乘法分析
6. HIERARCHICAL ORTHOGONAL MATRIX GENERATION AND MATRIX-VECTOR MULTIPLICATIONS IN RIGID BODY SIMULATIONS [O] . FUHUI FANG, JINGFANG HUANG, GARY HUBER, -1

机译：刚体模拟中的正交正交矩阵生成和矩阵向量乘法
7. Two-dimensional cache-oblivious sparse matrix-vector multiplication [O] . Yzelman Albert-Jan, Bisseling Rob H 2011

机译：二维高速缓存可忽略的稀疏矩阵矢量乘法

Two-dimensional cache-oblivious sparse matrix-vector multiplication

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅