A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs

Ping Guo; Chung-Wei Lee; Chung-wei Lee

首页> 外文期刊>Procedia Computer Science >A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs

【24h】

A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs

机译：GPU上SpMV的性能预测和分析集成框架

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents unique modeling algorithms of performance prediction for sparse matrix-vector multiplication on GPUs. Based on the algorithms, we develop a framework that is able to predict SpMV kernel performance and to analyze the reported prediction results. We make the following contributions: (1) We provide theoretical basis for the generation of benchmark matrices according to the hardware features of a given specific GPU. (2) Given a sparse matrix, we propose a quantitative method to collect some features representing its matrix settings. (3) We propose four performance modeling algorithms to accurately predict kernel performance for SpMV computing using CSR, ELL, COO, and HYB SpMV kernels. We evaluate the accuracy of our framework with 8 widely-used sparse matrices (totally 32 test cases) on NVIDIA Tesla K80 GPU. In our experiments, the average performance differences between the predicted and measured SpMV kernel execution times for CSR, ELL, COO, and HYB SpMV kernels are 5 . 1%, 5 . 3%, 1 . 7%, and 6 . 1%, respectively.

机译：本文提出了用于GPU上稀疏矩阵矢量乘法的性能预测的独特建模算法。基于这些算法，我们开发了一个能够预测SpMV内核性能并分析报告的预测结果的框架。我们做出了以下贡献：（1）根据给定特定GPU的硬件功能，为基准矩阵的生成提供了理论基础。（2）给定一个稀疏矩阵，我们提出了一种定量方法来收集一些代表其矩阵设置的特征。（3）我们提出了四种性能建模算法，以使用CSR，ELL，COO和HYB SpMV内核准确预测SpMV计算的内核性能。我们在NVIDIA Tesla K80 GPU上使用8种广泛使用的稀疏矩阵（总共32个测试用例）评估了我们框架的准确性。在我们的实验中，CSR，ELL，COO和HYB SpMV内核的预测和测得的SpMV内核执行时间之间的平均性能差异为5。 1％，5。 3％，1。 7％，和6。分别为1％。

著录项

来源
《Procedia Computer Science》 |2016年第22期|共12页
作者
Ping Guo; Chung-Wei Lee; Chung-wei Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Performance Analysis and Optimization for SpMV on GPU Using Probabilistic Modeling [J] . Li K., Yang W., Li K. Parallel and Distributed Systems, IEEE Transactions on . 2015,第1期

机译：基于概率建模的SpMV在GPU上的性能分析和优化
2. Performance Optimization Using Partitioned SpMV on GPUs and Multicore CPUs [J] . Yang Wangdong, Li Kenli, Mo Zeyao, Computers, IEEE Transactions on . 2015,第9期

机译：在GPU和多核CPU上使用分区SpMV进行性能优化
3. Accurate cross-architecture performance modeling for sparse matrix-vector multiplication (SpMV) on GPUs [J] . Ping Guo, Liqiang Wang Concurrency, practice and experience . 2015,第13期

机译：GPU上的稀疏矩阵矢量乘法（SpMV）的准确跨体系结构性能建模
4. Performance Prediction for CSR-Based SpMV on GPUs Using Machine Learning [C] . Ping Guo, Changjiang Zhang IEEE International Conference on Computer and Communications . 2018

机译：使用机器学习在GPU上基于CSR的SpMV的性能预测
5. High performance multiscale image processing framework on multi-GPUs (graphics processing units) with applications to unbiased diffeomorphic atlas construction. [D] . Ha, Linh Khanh. 2011

机译：多GPU（图形处理单元）上的高性能多尺度图像处理框架，可应用于无偏微晶图集构造。
6. Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs GPUs and MICs: A Case Study with Microscopy Image Analysis [O] . George Teodoro, Tahsin Kurc, Guilherme Andrade, -1

机译：具有多核CPUGPU和MIC的系统上的应用程序性能分析和高效执行：以显微镜图像分析为例
7. A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs [O] . Guo Ping, Lee Chung-wei 2016

机译：GPU上SpMV的性能预测和分析集成框架

A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs

摘要

著录项

相似文献

相关主题

期刊订阅