首页> 外文会议>Applied parallel and scientific computing.;part 1. >CUDA 2D Stencil Computations for the Jacobi Method
【24h】

CUDA 2D Stencil Computations for the Jacobi Method

机译:用于Jacobi方法的CUDA 2D模板计算

获取原文
获取原文并翻译 | 示例

摘要

We are witnessing the consolidation of the CPUs streaming paradigm in parallel computing. This paper explores stencil operations in CUDA to optimize on CPUs the Jacobi method for solving Laplace's differential equation. The code keeps constant the access pattern through a large number of loop iterations, that way being representative of a wide set of iterative linear algebra algorithms. Optimizations are focused on data parallelism, threads deployment and the GPU memory hierarchy, whose management is explicit by the CUDA programmer. Experimental results are shown on Nvidia Teslas C870 and C1060 CPUs and compared to a counterpart version optimized on a quadcore Intel CPU. The speedup factor for our set of GPU optimizations reaches 3-4x and the execution times defeat those of the CPU by a wide margin, also showing great scalability when moving towards a more sophisticated GPU architecture and/or more demanding problem sizes.
机译:我们正在目睹并行计算中CPU流模式的整合。本文探讨了CUDA中的模版操作,以在CPU上优化Jacobi方法来求解Laplace微分方程。该代码通过大量的循环迭代来保持访问模式不变,从而代表了广泛的迭代线性代数算法。优化的重点是数据并行性,线程部署和GPU内存层次结构,CUDA程序员对其进行明确管理。实验结果显示在Nvidia Teslas C870和C1060 CPU上,并与在四核Intel CPU上优化的对应版本进行了比较。我们的一组GPU优化的加速因子达到了3-4倍,执行时间大大击败了CPU,在朝着更复杂的GPU架构和/或更苛刻的问题规模发展时,也显​​示出巨大的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号