BestSF A Sparse Meta-Format for Optimizing SpMV on GPU

Benatia Akrem; Ji Weixing; Wang Yizhuo; Shi Feng

首页> 外文期刊>ACM Transactions on Architecture and Code Optimization >BestSF A Sparse Meta-Format for Optimizing SpMV on GPU

【24h】

BestSF A Sparse Meta-Format for Optimizing SpMV on GPU

机译：最好的稀疏元格式，可以在GPU上优化SPMV

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed to improve this kernel on the recent GPU architectures. However, it has been widely observed that there is no "best-for-all" sparse format for the SpMV kernel on GPU. Indeed, serious performance degradation of an order of magnitude can be observed without a careful selection of the sparse format to use. To address this problem, we propose in this article BestSF (Best Sparse Format), a new learning-based sparse meta-format that automatically selects the most appropriate sparse format for a given input matrix. To do so, BestSF relies on a cost-sensitive classification system trained using Weighted Support Vector Machines (WSVMs) to predict the best sparse format for each input sparse matrix. Our experimental results on two different NVIDIA GPU architectures using a large number of real-world sparse matrices show that BestSF achieved a noticeable overall performance improvement over using a single sparse format. While BestSF is trained to select the best sparse format in terms of performance (GFLOPS), our further experimental investigations revealed that using BestSF also led, in most of the test cases, to the best energy efficiency (MFLOPS/W). To prove its practical effectiveness, we also evaluate the performance and energy efficiency improvement achieved when using BestSF as a building block in a GPU-based Preconditioned Conjugate Gradient (PCG) iterative solver.

机译：稀疏矩阵矢量乘法（SPMV）内核主导了许多科学应用中的计算成本。提出了基于不同稀疏格式的许多实现，以改进最近的GPU架构上的这个内核。然而，已经普遍观察到GPU上的SPMV内核没有“最适合所有”稀疏格式。实际上，可以观察到幅度的严重性能下降，而无需仔细选择要使用的稀疏格式。为了解决这个问题，我们提出了本文的Bestsf（最佳稀疏格式），一种新的基于学习的稀疏元格式，可自动为给定输入矩阵选择最合适的稀疏格式。为此，最好依赖于使用加权支持向量机（WSVM）训练的成本敏感的分类系统来预测每个输入稀疏矩阵的最佳稀疏格式。我们对两种不同的NVIDIA GPU架构的实验结果使用大量真实世界稀疏矩阵表明，最好使用单一稀疏格式实现明显的整体性能改进。虽然Bestsf受过培训以在性能（GFlops）方面选择最佳稀疏格式，但我们的进一步实验研究表明，在大多数测试用例中使用最佳LED也以最佳的能效（MFLOPS / W）。为了证明其实用效果，我们还评估了在基于GPU的预处理共轭梯度（PCG）迭代求解器中使用Bestsf作为构建块时实现的性能和能效改进。

著录项

来源
《ACM Transactions on Architecture and Code Optimization》 |2018年第3期|共27页
作者
Benatia Akrem; Ji Weixing; Wang Yizhuo; Shi Feng;
展开▼
作者单位

Beijing Inst Technol 5 South Zhongguancun St Beijing 100081 Peoples R China;

Beijing Inst Technol 5 South Zhongguancun St Beijing 100081 Peoples R China;

Beijing Inst Technol 5 South Zhongguancun St Beijing 100081 Peoples R China;

Beijing Inst Technol 5 South Zhongguancun St Beijing 100081 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Sparse matrix-vector multiplication (SpMV); GPU computing; performance modeling; energy efficiency; iterative solvers;

机译：稀疏矩阵 - 矢量乘法（SPMV）;GPU计算;性能建模;能效;迭代求解器;

相似文献

外文文献
中文文献
专利

1. PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs [J] . Zhiqi Wang, Tongxiang Gu 电脑和通信（英文） . 2020,第004期
2. BestSF A Sparse Meta-Format for Optimizing SpMV on GPU [J] . Benatia Akrem, Ji Weixing, Wang Yizhuo, ACM Transactions on Architecture and Code Optimization . 2018,第3期

机译：最好的稀疏元格式，可以在GPU上优化SPMV
3. Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms [J] . Benatia Akrem, Ji Weixing, Wang Yizhuo, Experimental Mechanics . 2020,第1期

机译：稀疏矩阵划分，用于在CPU-GPU异构平台上优化SpMV
4. SpMV and BiCG-Stab optimization for a class of hepta-diagonal-sparse matrices on GPU [J] . Al-Mouhamed Mayez A., Khan Ayaz H. Journal of supercomputing . 2017,第9期

机译：SpMV和BiCG-Stab优化针对GPU上的一类七对角稀疏矩阵
5. Optimizing SpMV for Diagonal Sparse Matrices on GPU [C] . Sun Xiangzheng, Zhang Yunquan, Wang Ting, 2011 International Conference on Parallel Processing . 2011

机译：在GPU上为对角稀疏矩阵优化SpMV
6. Developing a New Storage Format and a Warp-Based SpMV Kernel for Configuration Interaction Sparse Matrices on the GPU [D] . Mahmoud, Mohammed. 2018

机译：为GPU上的配置交互稀疏矩阵开发新的存储格式和基于Warp的SpMV内核
7. Next-generation acceleration and code optimization for light transport in turbid media using GPUs [O] . Erik Alerstam, William Chun Yip Lo, Tianyi David Han, 2010

机译：下一代加速和代码优化使用GPU在混浊的介质中传输
8. Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms [O] . Akrem Benatia, Weixing Ji, Yizhuo Wang, 2019

机译：用于优化CPU-GPU异构平台SPMV的稀疏矩阵分区

BestSF A Sparse Meta-Format for Optimizing SpMV on GPU

摘要

著录项

相似文献

相关主题

期刊订阅