首页> 外文会议>IEEE/ACM international symposium on cluster, cloud and grid computing >An Evaluation of Unified Memory Technology on NVIDIA GPUs
【24h】

An Evaluation of Unified Memory Technology on NVIDIA GPUs

机译:NVIDIA GPU上的统一内存技术评估

获取原文

摘要

Unified Memory is an emerging technology which is supported by CUDA 6.X. Before CUDA 6.X, the existing CUDA programming model relies on programmers to explicitly manage data between CPU and GPU and hence increases programming complexity. CUDA 6.X provides a new technology which is called as Unified Memory to provide a new programming model that defines CPU and GPU memory space as a single coherent memory (imaging as a same common address space). The system manages data access between CPU and GPU without explicit memory copy functions. This paper is to evaluate the Unified Memory technology through different applications on different GPUs to show the users how to use the Unified Memory technology of CUDA 6.X efficiently. The applications include Diffusion3D Benchmark, Parboil Benchmark Suite, and Matrix Multiplication from the CUDA SDK Samples. We changed those applications to corresponding Unified Memory versions and compare those with the original ones. We selected the NVIDIA Keller K40 and the Jetson TK1, which can represent the latest GPUs with Keller architecture and the first mobile platform of NVIDIA series with Keller GPU. This paper shows that Unified Memory versions cause 10% performance loss on average. Furthermore, we used the NVIDIA Visual Profiler to dig the reason of the performance loss by the Unified Memory technology.
机译:统一内存是CUDA 6.X支持的一项新兴技术。在CUDA 6.X之前,现有的CUDA编程模型依靠程序员来显式管理CPU和GPU之间的数据,从而增加了编程复杂性。 CUDA 6.X提供了一种称为统一内存的新技术,它提供了一种新的编程模型,该模型将CPU和GPU内存空间定义为单个一致性内存(映像为相同的公共地址空间)。该系统无需显式的内存复制功能即可管理CPU和GPU之间的数据访问。本文旨在通过不同GPU上的不同应用程序对统一内存技术进行评估,以向用户展示如何有效地使用CUDA 6.X的统一内存技术。这些应用程序包括CUDA SDK样本中的Diffusion3D基准,Parboil基准套件和矩阵乘法。我们将这些应用程序更改为相应的统一内存版本,并将其与原始版本进行比较。我们选择了NVIDIA Keller K40和Jetson TK1,它们可以代表具有Keller架构的最新GPU和具有Keller GPU的NVIDIA系列的第一个移动平台。本文显示统一内存版本平均会导致10%的性能损失。此外,我们使用NVIDIA Visual Profiler来找出统一内存技术导致性能下降的原因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号