Sparse Linear Algebra on AMD and NVIDIA GPUs-The Race Is On

机译：AMD和NVIDIA GPU上的稀疏线性代数-竞赛正在进行中

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficiently processing sparse matrices is a central and performance-critical part of many scientific simulation codes. Recognizing the adoption of manycore accelerators in HPC, we evaluate in this paper the performance of the currently best sparse matrix-vector product (SpMV) implementations on high-end GPUs from AMD and NVIDIA. Specifically, we optimize SpMV kernels for the CSR, COO, ELL, and HYB format taking the hardware characteristics of the latest GPU technologies into account. We compare for 2,800 test matrices the performance of our kernels against AMD's hipSPARSE library and NVIDIA's cuSPARSE library, and ultimately assess how the GPU technologies from AMD and NVIDIA compare in terms of SpMV performance.

机译：有效处理稀疏矩阵是许多科学仿真代码的核心和性能至关重要的部分。认识到HPC中采用了许多核心加速器，我们在本文中评估了AMD和NVIDIA在高端GPU上当前最佳的稀疏矩阵矢量产品（SpMV）实现的性能。具体来说，我们会考虑最新GPU技术的硬件特性，针对CSR，COO，ELL和HYB格式优化SpMV内核。我们将2,800个测试矩阵与AMD的hipSPARSE库和NVIDIA的cuSPARSE库进行了比较，最终评估了AMD和NVIDIA的GPU技术在SpMV性能方面的比较。

著录项

来源
《International Conference ISC High Performance: International Conference on High Performance Computing》|2020年|309-327|共19页
会议地点
作者
Yuhsiang M. Tsai; Terry Cojean; Hartwig Anzt;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sparse matrix vector product (SpMV); GPUs; AMD; NVIDIA;

机译：稀疏矩阵向量积（SpMV）; GPU; AMD;英伟达;

相似文献

外文文献
中文文献
专利

1. A parallel and vectorial implementation of basic linear algebra subroutines in iterative solving of large sparse linear systems of equations [J] . Magnin H., Coulomb J.L. IEEE Transactions on Magnetics . 1989,第4期

机译：大型稀疏线性方程组的迭代求解中的基本线性代数子例程的并行和矢量实现
2. A customized precision format based on mantissa segmentation for accelerating sparse linear algebra [J] . Thomas Grützmacher, Terry Cojean, Goran Flegar, Concurrency, practice and experience . 2020,第15期

机译：基于Mantissa分割的自定义精度格式加速稀疏线性代数
3. Memristive Accelerators for Dense and Sparse Linear Algebra: From Machine Learning to High-Performance Scientific Computing [J] . Ipek Engin IEEE Micro . 2019,第1期

机译：致密和稀疏线性代数的忆阻加速器：从机器学习到高性能科学计算
4. Solving Sparse Linear Systems on NVIDIA Tesla GPUs [C] . Mingliang Wang, Hector Klie, Manish Parashar, International conference on computational science;ICCS 2009 . 2009

机译：在NVIDIA Tesla GPU上解决稀疏线性系统
5. Reducing Data Movement Energy on Dense and Sparse Linear Algebra Workloads: From Machine Learning to High Performance Scientific Computing [D] . Feinberg, Ben. 2019

机译：减少密集和稀疏线性代数工作负载上的数据移动能量：从机器学习到高性能科学计算
6. Off-Grid Direction of Arrival Estimation Based on Joint Spatial Sparsity for Distributed Sparse Linear Arrays [O] . Yujie Liang, Rendong Ying, Zhenqi Lu, 2014

机译：分布式稀疏线性阵列基于联合空间稀疏性的离网到达方向估计

Sparse Linear Algebra on AMD and NVIDIA GPUs-The Race Is On

摘要

著录项

相似文献

相关主题

期刊订阅