【24h】

Predicting Execution Time of CUDA Kernel Using Static Analysis

机译:使用静态分析预测CUDA内核的执行时间

获取原文
获取原文并翻译 | 示例

摘要

With the growing demand for performance-oriented problems, programmers routinely execute the embarrassing parallel part of the application (GPU kernels) in a GPU in order to achieve signi?cant speedup. These applications are becoming complex and long-running which makes it energy inef?cient. Anticipating its execution time can help the developers to ?x the inef?cient code before running it. In this paper, we propose an approach to predict the execution time of a GPU kernel without the need of executing it. We build an analytical model to predict the execution time of a GPU kernel by analyzing the intermediate PTX code of a CUDA kernel. Our experimental analysis of a set of benchmarks shows that for 45 applications the estimated execution time has the mean absolute error of 26.86% when compared to the actual execution time. Mean absolute error for benchmarks belonging to Dynamic programming dwarf is minimum, followed by Dense Linear Algebra benchmarks.
机译:随着对性能相关问题的需求日益增长,程序员通常在GPU中执行令人尴尬的并行应用程序部分(GPU内核),以实现显着的加速。这些应用程序变得越来越复杂且需要长期运行,这使其能源效率低下。预期其执行时间可以帮助开发人员在运行无效代码之前对其进行修复。在本文中,我们提出了一种无需执行GPU内核即可预测GPU内核执行时间的方法。我们通过分析CUDA内核的中间PTX代码,构建了一个分析模型来预测GPU内核的执行时间。我们对一组基准的实验分析表明,对于45个应用程序,与实际执行时间相比,估计执行时间的平均绝对误差为26.86%。属于动态编程侏儒的基准的平均绝对误差最小,其次是密集线性代数基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号