首页> 外文会议>2017 IEEE 24th International Conference on High Performance Computing >Characterization of Data Movement Requirements for Sparse Matrix Computations on GPUs
【24h】

Characterization of Data Movement Requirements for Sparse Matrix Computations on GPUs

机译:GPU上稀疏矩阵计算的数据移动要求的表征

获取原文
获取原文并翻译 | 示例

摘要

Tight data movement lower bounds are known for dense matrix-vector multiplication and dense matrix-matrix multiplication and practical implementations exist on GPUs that achieve performance quite close to the roofline bounds based on operational intensity. For large dense matrices, matrix-vector multiplication is bandwidth-limited and its performance is significantly lower than matrix-matrix multiplication. However, in contrast, the performance of sparse matrix-matrix multiplication (SpGEMM) is generally much lower than that of sparse matrix-vector multiplication (SpMV). In this paper, we use a combination of lower-bounds and upper-bounds analysis of data movement requirements, as well as hardware counter based measurements to gain insights into the performance limitations of existing implementations for SpGEMM on GPUs. The analysis motivates the development of an adaptive work distribution strategy among threads and results in performance enhancement for SpGEMM code on GPUs.
机译:密集数据矩阵乘法和密集矩阵矩阵乘法的严格数据移动下限是众所周知的,并且在GPU上存在一些实际实现方式,这些实现方式的性能非常接近基于操作强度的车顶界限。对于大密度矩阵,矩阵矢量乘法受到带宽限制,其性能明显低于矩阵矩阵乘法。但是,相比之下,稀疏矩阵-矩阵乘法(SpGEMM)的性能通常要比稀疏矩阵-矢量乘法(SpMV)的性能低得多。在本文中,我们结合了数据移动需求的上下限分析以及基于硬件计数器的度量,以深入了解SpGEMM在GPU上现有实现的性能局限性。该分析激励了线程之间的自适应工作分配策略的发展,并提高了GPU上SpGEMM代码的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号