...
首页> 外文期刊>Procedia Computer Science >A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs
【24h】

A Performance Prediction and Analysis Integrated Framework for SpMV on GPUs

机译:GPU上SpMV的性能预测和分析集成框架

获取原文
   

获取外文期刊封面封底 >>

       

摘要

This paper presents unique modeling algorithms of performance prediction for sparse matrix-vector multiplication on GPUs. Based on the algorithms, we develop a framework that is able to predict SpMV kernel performance and to analyze the reported prediction results. We make the following contributions: (1) We provide theoretical basis for the generation of benchmark matrices according to the hardware features of a given specific GPU. (2) Given a sparse matrix, we propose a quantitative method to collect some features representing its matrix settings. (3) We propose four performance modeling algorithms to accurately predict kernel performance for SpMV computing using CSR, ELL, COO, and HYB SpMV kernels. We evaluate the accuracy of our framework with 8 widely-used sparse matrices (totally 32 test cases) on NVIDIA Tesla K80 GPU. In our experiments, the average performance differences between the predicted and measured SpMV kernel execution times for CSR, ELL, COO, and HYB SpMV kernels are 5 . 1%, 5 . 3%, 1 . 7%, and 6 . 1%, respectively.
机译:本文提出了用于GPU上稀疏矩阵矢量乘法的性能预测的独特建模算法。基于这些算法,我们开发了一个能够预测SpMV内核性能并分析报告的预测结果的框架。我们做出了以下贡献:(1)根据给定特定GPU的硬件功能,为基准矩阵的生成提供了理论基础。 (2)给定一个稀疏矩阵,我们提出了一种定量方法来收集一些代表其矩阵设置的特征。 (3)我们提出了四种性能建模算法,以使用CSR,ELL,COO和HYB SpMV内核准确预测SpMV计算的内核性能。我们在NVIDIA Tesla K80 GPU上使用8种广泛使用的稀疏矩阵(总共32个测试用例)评估了我们框架的准确性。在我们的实验中,CSR,ELL,COO和HYB SpMV内核的预测和测得的SpMV内核执行时间之间的平均性能差异为5。 1%,5。 3%,1。 7%,和6。分别为1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号