...
首页> 外文期刊>Parallel Computing >Automatic tuning of the sparse matrix vector product on GPUs based on the ELLR-T approach
【24h】

Automatic tuning of the sparse matrix vector product on GPUs based on the ELLR-T approach

机译:基于ELLR-T方法在GPU上自动调整稀疏矩阵矢量积

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A wide range of applications in engineering and scientific computing are involved in the acceleration of the sparse matrix vector product (SpMV). Graphics Processing Units (GPUs) have recently emerged as platforms that yield outstanding acceleration factors. SpMV implementations for GPUs have already appeared on the scene. This work is focused on the ELLR-T algorithm to compute SpMV on GPU architecture, its performance is strongly dependent on the optimum selection of two parameters. Therefore, taking account that the memory operations dominate the performance of ELLR-T, an analytical model is proposed in order to obtain the auto-tuning of ELLR-T for particular combinations of sparse matrix and GPU architecture. The evaluation results with a representative set of test matrices show that the average performance achieved by auto-tuned ELLR-T by means of the proposed model is near to the optimum. A comparative analysis of ELLR-T against a variety of previous proposals shows that ELLR-T with the estimated configuration reaches the best performance on GPU architecture for the representative set of test matrices.
机译:稀疏矩阵矢量积(SpMV)的加速涉及工程和科学计算中的广泛应用。图形处理单元(GPU)最近已经成为可产生出色加速因子的平台。用于GPU的SpMV实现已经出现在现场。这项工作的重点是在GPU架构上计算SpMV的ELLR-T算法,其性能强烈取决于两个参数的最佳选择。因此,考虑到存储器操作支配ELLR-T的性能,提出了一种解析模型,以便针对稀疏矩阵和GPU架构的特定组合获得ELLR-T的自动调整。一组代表性测试矩阵的评估结果表明,通过提出的模型自动调整的ELLR-T所实现的平均性能接近最佳水平。对ELLR-T与各种先前建议的比较分析表明,对于代表性的测试矩阵集,具有估计配置的ELLR-T在GPU架构上达到了最佳性能。

著录项

  • 来源
    《Parallel Computing》 |2012年第8期|408-420|共13页
  • 作者单位

    Almeria University, Dpt Computer Architecture and Electronics, Ctra San Urbano s Canada, 04120 Almeria, Spain;

    Almeria University, Dpt Computer Architecture and Electronics, Ctra San Urbano s Canada, 04120 Almeria, Spain,Centra Nacional de Biotecnologia (CNB-CSIC), Darwin 3, Campus de Cantoblanco, 28049 Madrid, Spain;

    Almeria University, Dpt Computer Architecture and Electronics, Ctra San Urbano s Canada, 04120 Almeria, Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Sparse matrix vector product; GPU computing; GPU performance modeling;

    机译:稀疏矩阵向量积;GPU计算;GPU性能建模;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号