首页> 外文会议>ACM/IEEE International Symposium on Computer Architecture >Real-World Design and Evaluation of Compiler-Managed GPU Redundant Multithreading
【24h】

Real-World Design and Evaluation of Compiler-Managed GPU Redundant Multithreading

机译:编译器管理GPU冗余多线程的现实世界设计与评估

获取原文

摘要

Reliability for general purpose processing on the GPU (GPGPU) is becoming a weak link in the construction of reliable supercomputer systems. Because hardware protection is expensive to develop, requires dedicated on-chip resources, and is not portable across different architectures, the efficiency of software solutions such as redundant multithreading (RMT) must be explored. This paper presents a real-world design and evaluation of automatic software RMT on GPU hardware. We first describe a compiler pass that automatically converts GPGPU kernels into redundantly threaded versions. We then perform detailed power and performance evaluations of three RMT algorithms, each of which provides fault coverage to a set of structures in the GPU. Using real hardware, we show that compiler-managed software RMT has highly variable costs. We further analyze the individual costs of redundant work scheduling, redundant computation, and inter-thread communication, showing that no single component in general is responsible for high overheads across all applications; instead, certain workload properties tend to cause RMT to perform well or poorly. Finally, we demonstrate the benefit of architectural support for RMT with a specific example of fast, register-level thread communication.
机译:GPU上通用处理的可靠性(GPGPU)正在成为可靠的超级计算机系统构造中的薄弱环节。由于硬件保护要开发昂贵,需要专用的片上资源,并且不在不同架构上便携,必须探索冗余多线程(RMT)等软件解决方案的效率。本文介绍了GPU硬件上自动软件RMT的真实设计和评估。我们首先描述一个编译器通过,它会自动将GPGPU内核转换为冗余的线程版本。然后,我们执行三个RMT算法的详细功率和性能评估,每个功率和性能评估每个都为GPU中的一组结构提供故障覆盖。使用真实硬件,我们显示编译器管理的软件RMT具有高度可变成本。我们进一步分析了冗余工作调度,冗余计算和线程间通信的个别成本,显示没有单个组件通常是对所有应用的高开销负责;相反,某些工作负载属性往往会导致RMT执行良好或不佳。最后,我们展示了RMT的架构支持的益处,具有快速寄存器级线程通信的具体示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号