首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications
【24h】

Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications

机译:用于图形应用的GPU上的快速稀疏矩阵-向量乘法

获取原文

摘要

Sparse matrix-vector multiplication (SpMV) is a widely used computational kernel. The most commonly used format for a sparse matrix is CSR (Compressed Sparse Row), but a number of other representations have recently been developed that achieve higher SpMV performance. However, the alternative representations typically impose a significant preprocessing overhead. While a high preprocessing overhead can be amortized for applications requiring many iterative invocations of SpMV that use the same matrix, it is not always feasible -- for instance when analyzing large dynamically evolving graphs. This paper presents ACSR, an adaptive SpMV algorithm that uses the standard CSR format but reduces thread divergence by combining rows into groups (bins) which have a similar number of non-zero elements. Further, for rows in bins that span a wide range of non zero counts, dynamic parallelism is leveraged. A significant benefit of ACSR over other proposed SpMV approaches is that it works directly with the standard CSR format, and thus avoids significant preprocessing overheads. A CUDA implementation of ACSR is shown to outperform SpMV implementations in the NVIDIA CUSP and cuSPARSE libraries on a set of sparse matrices representing power-law graphs. We also demonstrate the use of ACSR for the analysis of dynamic graphs, where the improvement over extant approaches is even higher.
机译:稀疏矩阵向量乘法(SpMV)是一种广泛使用的计算内核。稀疏矩阵最常用的格式是CSR(压缩稀疏行),但是最近开发了许多其他表示形式,可以实现更高的SpMV性能。但是,替代表示通常会带来很大的预处理开销。对于需要使用相同矩阵进行SpMV的多次迭代调用的应用程序,可以分摊高昂的预处理开销,但这并不总是可行的-例如,在分析大型动态演化图时。本文介绍了ACSR,这是一种自适应SpMV算法,它使用标准CSR格式,但通过将行合并为具有相似数量的非零元素的组(bin)来减少线程发散。此外,对于跨非零计数范围广泛的bin中的行,利用了动态并行性。与其他提议的SpMV方法相比,ACSR的显着优势是它可以直接与标准CSR格式一起使用,从而避免了巨大的预处理开销。在一组表示幂律图的稀疏矩阵上,显示出ACSR的CUDA实现优于NVIDIA CUSP和cuSPARSE库中的SpMV实现。我们还演示了使用ACSR进行动态图分析,与现有方法相比,该方法的改进甚至更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号