POSTER: GPUs Pipeline Latency Analysis

机译：海报：GPU管线延迟分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we propose a very low overhead and portable analysis for exposing the hidden latency of each individual instruction executing in the pipeline and different access latencies of the various memory hierarchies at the microarchitecture level. We also show the impact of the possible optimizations a CUDA compiler have over the various latencies. We run our evaluation on seven different high-end NVIDIA GPUs from five different generations/architectures namely: Kepler, Maxwell, Pascal, Volta, and Turing.

机译：在这项工作中，我们提出了一个非常低的开销和可移植性分析，以揭示在微体系结构级别在管道中执行的每条指令的隐藏等待时间以及各种内存层次结构的不同访问等待时间。我们还展示了CUDA编译器可能进行的优化对各种延迟的影响。我们对来自五种不同的代/体系结构的七个不同的高端NVIDIA GPU进行了评估，它们分别是：开普勒，麦克斯韦，帕斯卡，沃尔特和图灵。

著录项

来源
《International Conference on Application-specific Systems, Architectures and Processors 》|2019年|139-139|共1页
会议地点
作者
Yehia Arafa; Abdel-Hameed A. Badawy; Gopinath Chennupati; Nandakishore Santhi; Stephan Eidenbenz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Graphics processing units; Computer architecture; Hardware; Pipelines; Optimization; Instruction sets; Programming;

机译：图形处理单元;计算机体系结构;硬件;管道;优化;指令集;编程;

相似文献

外文文献
中文文献
专利

1. GPU-acceleration on a low-latency binary-coalescence gravitational wave search pipeline [J] . Guo Xiangyu, Chu Qi, Chung Shin Kee, Computer physics communications . 2018 ,第期

机译：低延迟二进制聚结重重力波搜索管道的GPU加速
2. POSTER: Performance Modeling for GPUs using Abstract Kernel Emulation [J] . Changwan Hong, Aravind Sukumaran-Rajam, Jinsung Kim, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2018 ,第1期

机译：海报：使用抽象内核仿真GPU的性能建模
3. 1113 POSTER Brief instrument to identify information preference groups in cancer patients: a latent-class analysis [J] . M.Neumann, M.Wirtz, E.Bollschweiler, European Journal of Cancer Supplements . 2007 ,第4期

机译：1113 POSTER用于识别癌症患者信息偏好人群的简要工具：潜在类别分析
4. POSTER: GPUs Pipeline Latency Analysis [C] . Yehia Arafa, Abdel-Hameed A. Badawy, Gopinath Chennupati, International Conference on Application-specific Systems, Architectures and Processors . 2019

机译：海报：GPU管道延迟分析
5. Understanding Latency Hiding on GPUs. [D] . Volkov, Vasily. 2016

机译：了解GPU上的延迟隐藏。
6. Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs GPUs and MICs: A Case Study with Microscopy Image Analysis [O] . George Teodoro, Tahsin Kurc, Guilherme Andrade, -1

机译：具有多核CPUGPU和MIC的系统上的应用程序性能分析和高效执行：以显微镜图像分析为例
7. Enabling a High Throughput Real Time Data Pipeline for a Large Radio Telescope Array with GPUs [O] . Edgar, R. G., Clark, M. A., Dale, K., 2010

机译：为大型无线电启用高吞吐量实时数据流水线带GpU的望远镜阵列

POSTER: GPUs Pipeline Latency Analysis

摘要

著录项

相似文献

相关主题

期刊订阅