首页> 外文期刊>Parallel Computing >Two-dimensional cache-oblivious sparse matrix-vector multiplication
【24h】

Two-dimensional cache-oblivious sparse matrix-vector multiplication

机译:二维高速缓存可忽略的稀疏矩阵矢量乘法

获取原文
获取原文并翻译 | 示例
           

摘要

In earlier work, we presented a one-dimensional cache-oblivious sparse matrix-vector (SpMV) multiplication scheme which has its roots in one-dimensional sparse matrix partitioning. Partitioning is often used in distributed-memory parallel computing for the SpMV multiplication, an important kernel in many applications. A logical extension is to move towards using a two-dimensional partitioning. In this paper, we present our research in this direction, extending the one-dimensional method for cache-oblivious SpMV multiplication to two dimensions, while still allowing only row and column permutations on the sparse input matrix. This extension requires a generalisation of the compressed row storage data structure to a block-based data structure, for which several variants are investigated. Experiments performed on three different architectures show further improvements of the two-dimensional method compared to the one-dimensional method, especially in those cases where the one-dimensional method already provided significant gains. The largest gain obtained by our new reordering is over a factor of 3 in SpMV speed, compared to the natural matrix ordering.
机译:在较早的工作中,我们提出了一维高速缓存可忽略的稀疏矩阵矢量(SpMV)乘法方案,该方案起源于一维稀疏矩阵分区。分区通常在SpMV乘法的分布式内存并行计算中使用,SpMV乘法是许多应用程序中的重要内核。逻辑扩展是朝着使用二维分区的方向发展。在本文中,我们在此方向上介绍了我们的研究,将一维方法用于不让高速缓存使用的SpMV乘法扩展到了二维,同时仍然只允许稀疏输入矩阵上的行和列排列。此扩展要求将压缩的行存储数据结构推广到基于块的数据结构,为此将研究几种变体。在三种不同体系结构上进行的实验表明,与一维方法相比,二维方法有了进一步的改进,尤其是在一维方法已经提供了可观收益的情况下。与自然矩阵排序相比,我们新的重新排序获得的最大增益是SpMV速度的3倍以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号