【24h】

Evaluation of Splitting-Up Conjugate Gradient Method on GPUs

机译:GPU上的共轭共轭梯度法评估

获取原文

摘要

This paper describes the implementation of a preconditioned CG (Conjugate Gradient) method on GPUs and evaluates the performance compared with CPUs. Our CG method utilizes SP (Splitting-Up) preconditioner, which is suitable for parallel processing because other dimensions except for one dimension are independent. In order to enhance the memory bandwidth to the global memory of GPUs, our implementation utilizes a pseudo matrix transposition before and after a tridiagonal matrix solver, which results in coalesced memory accesses. In addition, the number of pseudo matrix transpositions can be reduced to only one by using a rotation configuration technique. By these techniques, the speedups of our approach can be enhanced by up to 102.2%.
机译:本文介绍了在GPU上实施预处理CG(共轭梯度)方法的过程,并评估了与CPU相比的性能。我们的CG方法利用SP(拆分)预处理器,由于一维以外的其他尺寸是独立的,因此它适合于并行处理。为了增加GPU全局内存的内存带宽,我们的实现利用了三对角矩阵求解器之前和之后的伪矩阵转置,这导致合并的内存访问。另外,通过使用旋转配置技术,伪矩阵转置的数量可以减少到仅一个。通过这些技术,我们的方法可以将速度提高多达102.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号