首页> 外文OA文献 >Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods

【2h】

Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods

机译：通过使用稀疏矩阵划分方法的高速缓存可忽略的稀疏矩阵矢量乘法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this article, we introduce a cache-oblivious method for sparse matrix–vector multiplication. Our method attempts to permute the rows and columns of the input matrix using a recursive hypergraph-based sparse matrix partitioning scheme so that the resulting matrix induces cache-friendly behavior during sparse matrix–vector multiplication. Matrices are assumed to be stored in row-major format, by means of the compressed row storage (CRS) or its variants incremental CRS and zig-zag CRS. The zig-zag CRS data structure is shown to fit well with the hypergraph metric used in partitioning sparse matrices for the purpose of parallel computation. The separated block-diagonal (SBD) form is shown to be the appropriate matrix structure for cache enhancement.We have implemented a run-time cache simulation library enabling us to analyze cache behavior for arbitrary matrices and arbitrary cache properties during matrix–vector multiplication within a k-way set-associative idealized cache model. The results of these simulations are then verified by actual experiments run on various cache architectures. In all these experiments, we use the Mondriaan sparse matrix partitioner in one-dimensional mode. The savings in computation time achieved by our matrix reorderings reach up to 50 percent, in the case of a large link matrix.

机译：在本文中，我们介绍了一种用于稀疏矩阵-矢量乘法的缓存忽略方法。我们的方法尝试使用基于递归超图的稀疏矩阵分区方案来置换输入矩阵的行和列，以使生成的矩阵在稀疏矩阵-矢量乘法期间引起高速缓存友好行为。假定矩阵通过压缩行存储（CRS）或其变体增量CRS和之字形CRS以行优先格式存储。所示之字形CRS数据结构与用于并行计算的稀疏矩阵划分中使用的超图度量非常吻合。分离的块对角线（SBD）形式被证明是用于高速缓存增强的合适矩阵结构。我们已经实现了运行时高速缓存仿真库，使我们能够在矩阵向量乘法期间分析任意矩阵和任意高速缓存属性的高速缓存行为。 k路径集关联的理想化缓存模型。然后，通过在各种缓存体系结构上运行的实际实验来验证这些模拟的结果。在所有这些实验中，我们以一维模式使用Mondriaan稀疏矩阵分配器。在大型链接矩阵的情况下，通过矩阵重新排序可以节省高达50％的计算时间。

著录项

作者
Yzelman Albert-Jan; Bisseling Rob H.;
展开▼
作者单位

展开▼
年度 2009
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Cache-Oblivious Sparse Matrix--Vector Multiplication by Using Sparse Matrix Partitioning Methods [J] . A. N. Yzelman., Rob H. Bisseling. SIAM Journal on Scientific Computing . 2010,第4期

机译：高速缓存不可忽略的稀疏矩阵-使用稀疏矩阵划分方法的矢量乘法
2. Two-dimensional cache-oblivious sparse matrix-vector multiplication [J] . A.N. Yzelman, Rob H. Bisseling Parallel Computing . 2011,第12期

机译：二维高速缓存可忽略的稀疏矩阵矢量乘法
3. Hypergraph partitioning based models and methods for exploiting cache locality in sparse matrix-vector multiplication [J] . Akbudak K., Kayaaslan E., Aykanat C. SIAM Journal on Scientific Computing . 2013,第3期

机译：基于超图分区的稀疏矩阵矢量乘法中利用缓存局部性的模型和方法
4. A nested dissection partitioning method for parallel sparse matrix-vector multiplication [C] . Boman Erik G., Wolf Michael M. IEEE Conference on High Performance Extreme Computing . 2013

机译：并行稀疏矩阵-向量乘法的嵌套解剖分割方法
5. Analysis of High Performance Sparse Matrix-Vector Multiplication for Small Finite Fields [D] . Lambert, Matthew A. 2020

机译：小型有限字段高性能稀疏矩阵矢量乘法分析
6. A unified framework for sparse non-negative least squares using multiplicative updates and the non-negative matrix factorization problem [O] . Igor Fedorov, Alican Nalci, Ritwik Giri, -1

机译：使用乘法更新和非负矩阵分解问题的稀疏非负最小二乘的统一框架
7. Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods [O] . A. N. Yzelman, Rob, H. Bisseling 2009

机译：使用稀疏矩阵分割方法进行高速缓存遗忘稀疏矩阵向量乘法

Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅