首页> 外文期刊>Journal of algorithms & computational technology >CUDA Memory Optimizations for Large Data-Structures in the Gravit Simulator
【24h】

CUDA Memory Optimizations for Large Data-Structures in the Gravit Simulator

机译:Gravit Simulator中针对大型数据结构的CUDA内存优化

获取原文
获取原文并翻译 | 示例
           

摘要

Modern GPUs open a completely new field to optimize embarrassingly parallel algorithms. Implementing an algorithm on a GPU confronts the programmer with a new set of challenges for program optimization. Especially tuning the program for the GPU memory hierarchy whose organization and performance implications are radically different from those of general purpose CPUs; and optimizing programs at the instruction-level for the GPU. In this paper we analyze different approaches for optimizing the memory usage and access patterns for GPUs and propose a class of memory layout optimizations that can take full advantage of the unique memory hierarchy of NVIDIA CUDA. Furthermore, we analyze some classical optimization techniques and how they effect the performance on a GPU. We used the Gravit gravity simulator to demonstrate these optimizations. The final optimized GPU version achieves a 87x speedup compared to the original CPU version. Almost 30% of this speedup are direct results of the optimizations discussed in this paper.
机译:现代GPU开辟了一个全新的领域,以优化令人尴尬的并行算法。在GPU上实现算法使程序员面临着程序优化方面的新挑战。特别是针对GPU内存层次结构调整程序,其组织和性能影响与通用CPU完全不同。在GPU的指令级上优化程序。在本文中,我们分析了用于优化GPU的内存使用和访问模式的不同方法,并提出了一类可以充分利用NVIDIA CUDA独特内存层次结构的内存布局优化。此外,我们分析了一些经典的优化技术以及它们如何影响GPU的性能。我们使用Gravit重力模拟器来演示这些优化。最终优化的GPU版本与原始CPU版本相比可实现87倍的加速。这种加速的几乎30%是本文讨论的优化的直接结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号