首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs
【24h】

An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs

机译:基于CUDA GPU的基于FFT的优化直接泊松求解器

获取原文
获取原文并翻译 | 示例
           

摘要

A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is presented. Our algorithms carefully manage the multiple layers of the memory hierarchy of the GPUs such that almost all the global memory accesses are coalesced into 128-byte device memory transactions, and all computations are carried out directly on the registers. A new strategy to interleave the FFT computation along each dimension with other computations is used to minimize the total number of accesses to the 3D grid. We illustrate the performance of our algorithms on the NVIDIA Tesla and Fermi architectures for a wide range of grid sizes, up to the largest size that can fit on the device memory ($(512times 512times 512)$ on the Tesla C1060/C2050 and $(512times 256times 256)$ on the GeForce GTX 280/480). We achieve up to 140 GFLOPS and a bandwidth of 70 GB/s on the Tesla C1060, and up to 375 GFLOPS with a bandwidth of 120GB/s on the GTX 480. The performance of our algorithms is superior to what can be achieved using the CUDA FFT library in combination with well-known parallel algorithms for solving tridiagonal linear systems of equations.
机译:提出了一种基于FFT的高度多线程的直接Poisson求解器,该求解器可以有效利用当前NVIDIA图形处理单元(GPU)的功能。我们的算法仔细管理GPU的内存层次结构的多层,以便几乎所有全局内存访问都合并为128字节的设备内存事务,并且所有计算都直接在寄存器上进行。一种将FFT计算沿每个维度与其他计算进行交织的新策略可用于最大程度地减少对3D网格的访问总数。我们将说明我们的算法在NVIDIA Tesla和Fermi架构上适用于各种网格大小的性能,最大可容纳设备内存的最大大小(Tesla C1060 / C2050和512美元分别为((512×512×512)$) (在GeForce GTX 280/480上为(512 x 256 x 256)$)。在Tesla C1060上,我们可以达到140 GFLOPS和70 GB / s的带宽,在GTX 480上可以达到375 GFLOPS和120 GB / s的带宽。我们算法的性能优于使用GTX 480可以实现的性能。 CUDA FFT库与著名的并行算法相结合,用于求解方程的三对角线性系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号