首页> 外文会议>International Conference on High Performance Computing, Data, and Analytics >GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU

【24h】

GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU

机译：GPU-FPTUNER：GPU上的浮点应用混合精密自动调整

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

GPUs have been extensively used to accelerate scientific applications from a variety of domains: computational fluid dynamics, astronomy and astrophysics, climate modeling, numerical analysis, to name a few. Many of these applications rely on floating-point arithmetic, which is approximate in nature. High-precision libraries have been proposed to mitigate accuracy issues due to the use of floating-point arithmetic. However, these libraries offer increased accuracy at a significant performance cost. Previous work, primarily focusing on CPU code and on standard IEEE floating-point data types, has explored mixed precision as a compromise between performance and accuracy. In this work, we propose a mixed precision autotuner for GPU applications that rely on floating-point arithmetic. Our tool supports standard 32- and 64-bit floating-point arithmetic, as well as high precision through the QD library. Our autotuner relies on compiler analysis to reduce the size of the tuning space. In particular, our tuning strategy takes into account code patterns prone to error propagation and GPU-specific considerations to generate a tuning plan that balances performance and accuracy. Our autotuner pipeline, implemented using the ROSE compiler and Python scripts, is fully automated and the code is available in open source. Our experimental results collected on benchmark applications with various code complexities show performance-accuracy tradeoffs for these applications and the effectiveness of our tool in identifying representative tuning points.

机译：GPU广泛用于加速来自各个领域的科学应用：计算流体动力学，天文学和天体物理学，气候建模，数值分析，命名几个。许多这些应用程序依赖于浮点算术，这在性质上是近似的。已经提出了高精度库以减轻由于使用浮点算术而减轻准确性问题。但是，这些库以显着的性能成本提供更高的准确性。以前的工作，主要关注CPU代码和标准IEEE浮点数据类型，在性能和准确性之间探讨了混合精度作为折衷。在这项工作中，我们为依赖于浮点算术的GPU应用提出了一种混合精密自动箱。我们的工具支持标准的32和64位浮点算术，以及通过QD库的高精度。我们的AutoTuner依赖于编译器分析来减少调谐空间的大小。特别是，我们的调整策略考虑了易于错误传播和GPU特定考虑的代码模式，以生成平衡性能和准确性的调整计划。我们使用玫瑰编译器和Python脚本实现的AutoTuner管道完全自动化，并且在开源中提供代码。我们的实验结果采集了具有各种代码复杂性的基准应用程序，显示了这些应用程序的性能准确性权衡以及我们工具在识别代表调谐点时的有效性。

著录项

来源
《International Conference on High Performance Computing, Data, and Analytics 》|2020年|294-304|共11页
会议地点
作者
Ruidong Gu; Michela Becchi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Numerical analysis; Pipelines; Graphics processing units; Tools; Libraries; Numerical models; Tuning;

机译：数值分析;管道;图形处理单元;工具;图书馆;数值模型;调整;

相似文献

外文文献
中文文献
专利

1. A Framework for Automated and Controlled Floating-Point Accuracy Reduction in Graphics Applications on GPUs [J] . Angerd Alexandra, Sintorn Erik, Stenstrom Per ACM Transactions on Architecture and Code Optimization . 2017 ,第4期

机译：GPU上图形应用中的自动化和控制浮点精度减少的框架
2. Auto-tuning for GPGPU applications using performance and energy model [J] . Lin Chih-Sheng, Teng Shih-Meng, Hsiung Pao-Ann Journal of systems architecture . 2016 ,第1期

机译：使用性能和能耗模型针对GPGPU应用进行自动调整
3. Performance and energy consumption of accurate and mixed-precision linear algebra kernels on GPUs [J] . Journal of Computational and Applied Mathematics . 2020 ,第期

机译：GPU上准确和混合精密线性代数粒细胞的性能和能耗
4. GPUMixer: Performance-Driven Floating-Point Tuning for GPU Scientific Applications [C] . Ignacio Laguna, Paul C. Wood, Ranvijay Singh, International conference on high performance computing workshops . 2019

机译：GPUMixer：GPU科学应用程序的性能驱动浮点调整
5. A Sampling-Based Approach to GPGPU Performance Auto-Tuning [D] . Feng, Cheng Xiang Wilson. 2017

机译：基于采样的GPGPU性能自动调整方法
6. Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs GPUs and MICs: A Case Study with Microscopy Image Analysis [O] . George Teodoro, Tahsin Kurc, Guilherme Andrade, -1

机译：具有多核CPUGPU和MIC的系统上的应用程序性能分析和高效执行：以显微镜图像分析为例
7. A Framework for Automated and Controlled Floating-Point Accuracy Reduction in Graphics Applications on GPUs [O] . Alexandra Angerd, Erik Sintorn, Per Stenström 2017

机译：GPU上图形应用中自动化和控制浮点精度减少的框架

GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU

摘要

著录项

相似文献

相关主题

期刊订阅