首页> 外文会议>Parallel and Distributed Computing and Networks >OPTIMISING DATA MOVEMENT RATES FOR PARALLEL PROCESSING APPLICATIONS ON GRAPHICS PROCESSORS
【24h】

OPTIMISING DATA MOVEMENT RATES FOR PARALLEL PROCESSING APPLICATIONS ON GRAPHICS PROCESSORS

机译:图形处理器上并行处理应用中的数据移动速率优化

获取原文

摘要

Graphics processing units(GPUs) are starting to play an increasingly important role in non-graphical applications which are highly parallelisable. With the latest graphics cards boasting a theoretical 165GFlops and 54GB/s memory bandwidth spread across 48 ALUs it is easy to see why. The GPU architecture is particularly suited to the parallel stream processing paradigm of low levels of data dependency, high data to instruction ratio and predictable memory access patterns. One largely ignored, yet key, bottleneck for this type of processing on GPUs is both download and readback transfer performance to and from the graphics card. Existing tools provide great developer assistance in many areas of GPU application development, though provide very limited assistance in gaining the best bi-directional data transfer performance. In this paper, we discuss these limitations and present new investigative tools which allow general purpose processing GPU developers to explore the complex array of configuration states which affect both the download and readback performance.
机译:图形处理单元(GPU)在高度可并行化的非图形应用程序中开始发挥越来越重要的作用。最新的图形卡拥有理论上的165GFlops和54GB / s的内存带宽分布在48个ALU上,因此很容易理解为什么。 GPU体系结构特别适合于数据依赖程度低,数据与指令之比高以及可预测的内存访问模式的并行流处理范例。在GPU上进行此类处理的一个很大程度上被忽略但仍是关键的瓶颈是与图形卡之间的下载和回读传输性能。现有工具在GPU应用程序开发的许多领域为开发人员提供了巨大的帮助,尽管在获得最佳的双向数据传输性能方面提供的帮助非常有限。在本文中,我们讨论了这些局限性,并提出了新的调查工具,这些工具使通用处理GPU开发人员能够探索影响下载和回读性能的复杂配置状态数组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号