首页> 外文会议>2017 IEEE 24th International Conference on High Performance Computing >Thrust++: Extending Thrust Framework for Better Abstraction and Performance
【24h】

Thrust++: Extending Thrust Framework for Better Abstraction and Performance

机译:Thrust ++:扩展Thrust框架以获得更好的抽象和性能

获取原文
获取原文并翻译 | 示例

摘要

A good design abstraction framework for high performance computing should provide a higher level programming abstraction that strikes a balance between the abstraction and visibility over the hardware so that the software developer can write a portable software without having to understand the hardware nuances, yet exploit the compute power optimally. In this paper we have analyzed a popular design abstraction framework called "Thrust" from NVIDIA, and proposed an extension called Thrust++ that provides abstraction over the memory hierarchy of an NVIDIA GPU. Thrust++ allows developers to make efficient use of shared memory and overall, provides better control over the GPU memory hierarchy while writing applications in Thrust style for the CUDA backend. We have shown that when applications are written for the CUDA backend using Thrust++, they have minimal performance degradation when compared to their equivalent CUDA versions. Further, Thrust++ provides almost 4x speedup when compared to Thrust, for certain compute intensive kernels that repeatedly use the reduce operation.
机译:一个用于高性能计算的好的设计抽象框架应提供更高级别的编程抽象,在硬件的抽象和可见性之间取得平衡,以便软件开发人员可以编写便携式软件,而不必了解硬件的细微差别,但可以利用计算最佳功率。在本文中,我们分析了NVIDIA提供的流行的设计抽象框架“ Thrust”,并提出了一个名为Thrust ++的扩展,该扩展提供了NVIDIA GPU的内存层次结构的抽象。 Thrust ++允许开发人员有效地使用共享内存和整体内存,在为CUDA后端以Thrust样式编写应用程序时,可以更好地控制GPU内存层次结构。我们已经证明,当使用Thrust ++为CUDA后端编写应用程序时,与同等CUDA版本相比,它们的性能下降最小。此外,对于某些重复使用reduce操作的计算密集型内核,Thrust ++与Thrust相比提供了近4倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号