From Sparse Matrix to Optimal GPU CUDA Sparse Matrix Vector Product Implementation

机译：从稀疏矩阵到最佳GPU CUDA稀疏矩阵矢量乘积实现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The CUDA model for GPUs presents the programmer with a plethora of different programming options. These includes different memory types, different memory access methods, and different data types. Identifying which options to use and when is a non-trivial exercise. This paper explores the effect of these different options on the performance of a routine that evaluates sparse matrix vector products. A process for analysing performance and selecting the subset of implementations that perform best is proposed. The potential for mapping sparse matrix attributes to optimal CUDA sparse matrix vector product implementation is discussed.

机译：用于GPU的CUDA模型为程序员提供了许多不同的编程选项。这些包括不同的内存类型，不同的内存访问方法和不同的数据类型。确定使用哪些选项以及何时使用是不平凡的练习。本文探讨了这些不同选项对评估稀疏矩阵向量乘积的例程的性能的影响。提出了一种用于分析性能并选择性能最佳的实现子集的过程。讨论了将稀疏矩阵属性映射到最佳CUDA稀疏矩阵矢量乘积实现的潜力。

著录项

来源
《10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing》|2010年|p.808-813|共6页
会议地点 Melbourne(AU);Melbourne(AU)
作者
El Zein Ahmed H.; Rendell Alistair P.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
GPU; Sparse Matrix; spmv;

机译：GPU;稀疏矩阵; spmv;

相似文献

外文文献
中文文献
专利

1. Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware [J] . Ahmed H. El Zein, Alistair P. Rendell Concurrency and Computation . 2012,第1期

机译：生成用于不断发展的GPU硬件的最佳CUDA稀疏矩阵矢量乘积实现
2. CUDA GPU libraries and novel sparse matrix-vector multiplication - implementation and performance enhancement in unstructured finite element computations [J] . Richard Haney, Ram Mohan International Journal of Computational Science and Engineering . 2019,第4期

机译：CUDA GPU库和新型稀疏矩阵 - 矢量乘法 - 非结构化有限元计算中的实现和性能增强
3. CUDA-enabled Sparse Matrix-Vector Multiplication on GPUs using atomic operations [J] . Hoang-Vu Dang, Bertil Schmidt Parallel Computing . 2013,第11期

机译：使用原子运算在GPU上启用CUDA的稀疏矩阵向量乘法
4. From Sparse Matrix to Optimal GPU CUDA Sparse Matrix Vector Product Implementation [C] . Ahmed H. El Zein, Alistair P. Rendell IEEE/ACM International Conference on Cluster, Cloud and Grid Computing . 2010

机译：从稀疏矩阵到最佳GPU CUDA稀疏矩阵矢量产品实现
5. Exploring the potential for accelerating sparse matrix-vector product on a Processing-in-Memory architecture [D] . Youssefi, Annahita 2009

机译：探索在内存中处理架构上加速稀疏矩阵矢量乘积的潜力
6. Computing the sparse matrix vector product using block-based kernels without zero padding on processors with AVX-512 instructions [O] . Bérenger Bramas, Pavel Kus 2018

机译：使用AVX-512指令的处理器上没有零填充的基于块的内核计算稀疏矩阵矢量产品
7. The Sliced COO Format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs [O] . Dang Hoang-Vu, Schmidt Bertil 2012

机译：启用CUDA的GPU上稀疏矩阵向量乘法的切片COO格式

From Sparse Matrix to Optimal GPU CUDA Sparse Matrix Vector Product Implementation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅