首页> 美国政府科技报告 >Analysis and Implementation of Particle-to-Particle (P2P) Graphics Processor Unit (GPU) Kernel for Black-Box Adaptive Fast Multipole Method.
【24h】

Analysis and Implementation of Particle-to-Particle (P2P) Graphics Processor Unit (GPU) Kernel for Black-Box Adaptive Fast Multipole Method.

机译:黑盒自适应快速多极子粒子到粒子图形处理器单元(GpU)核的分析与实现。

获取原文

摘要

The Black-Box Adaptive Fast Multipole Method (bbAFMM) has been generating some interest within the high-performance computing community as a tractable solution to the well-known n-body problem. The bbAFMM approximates the n-body solution using a series of independent functions or kernels that are attractive to high-performance code development using one or more graphics processor unit (GPU) devices. This work follows the analysis and implementation of the direct interaction called particle-to-particle kernel for a shared-memory single GPU device using the Compute Unified Device Architecture, revealing a performance boost of greater than 500 times over the corresponding serial central processing unit implementation. The objective of this work is to both document the implementation of the GPU kernel and provide a better understanding of the observed performance through an algorithmic analysis that focuses on arithmetic intensity, GPU memory bandwidth, GPU peak performance, and the defined Peripheral Component Interconnect Express bandwidth.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号