【24h】

Dynamic Sparse-Matrix Allocation on GPUs

机译:GPU上的动态稀疏矩阵分配

获取原文

摘要

Sparse matrices are a core component in many numerical simulations, and their efficiency is essential to achieving high performance. Dynamic sparse-matrix allocation (insertion) can benefit a number of problems such as sparse-matrix factorization, sparse-matrix-matrix addition, static analysis (e.g., points-to analysis), computing transitive closure, and other graph algorithms. Existing sparse-matrix formats are poorly designed to handle dynamic updates. The compressed sparse-row (CSR) format is fully compact and must be rebuilt after each new entry. Ellpack (ELL) stores a constant number of entries per row, which allows for efficient insertion and sparse matrix-vector multiplication (SpMV) but is memory inefficient and strictly limits row size. The coordinate (COO) format stores a list of entries and is efficient for both memory use and insertion time; however, it is much less efficient at SpMV. Hybrid ellpack (HYB) compromises by using a combination of ELL and COO but degrades in performance as the COO portion fills up. Rows that use the COO portion require it to be completely traversed during every SpMV operation. In this paper we introduce a new sparse matrix format, dynamic compressed sparse row (DCSR), that permits efficient dynamic updates. These updates are significantly faster than those made to a HYB matrix while maintaining SpMV times comparable to CSR. We demonstrate the efficacy of our dynamic allocation scheme, evaluating updates and SpMV operations on adjacency matrices of sparse-graph benchmarks on the GPU.
机译:稀疏矩阵是许多数值模拟的核心组成部分,它们的效率对于实现高性能至关重要。动态稀疏矩阵分配(插入)可以使许多问题受益,例如稀疏矩阵分解,稀疏矩阵加法,静态分析(例如指向分析),计算传递闭包以及其他图算法。现有的稀疏矩阵格式设计得很差,无法处理动态更新。压缩的稀疏行(CSR)格式是完全紧凑的,必须在每次添加新条目后重新构建。 Ellpack(ELL)每行存储恒定数量的条目,这可以实现高效插入和稀疏矩阵矢量乘法(SpMV),但内存效率低下,并严格限制了行大小。坐标(COO)格式存储条目列表,并且对于内存使用和插入时间都很有效;但是,它在SpMV上的效率要低得多。混合ellpack(HYB)通过结合使用ELL和COO进行折衷,但随着COO部分填满,性能会降低。使用COO部分的行要求在每次SpMV操作期间都将其完全遍历。在本文中,我们介绍了一种新的稀疏矩阵格式,即动态压缩的稀疏行(DCSR),它可以实现有效的动态更新。这些更新速度明显快于HYB矩阵,同时保持SpMV时间可与CSR相提并论。我们演示了动态分配方案的功效,它在GPU上基于稀疏图基准的邻接矩阵评估更新和SpMV操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号