首页> 外文会议>High performance computing symposium 2011. >Parallel GMRES implementation for solving sparse linear systems on GPU clusters
【24h】

Parallel GMRES implementation for solving sparse linear systems on GPU clusters

机译:并行GMRES实现,用于解决GPU集群上的稀疏线性系统

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose an efficient parallel implementation of the GMRES method for GPU clusters. This implementation requires us to parallelize the GMRES algorithm between the CPUs of the cluster. Hence, all parallel and intensive computations on local data are performed on GPUs and reduction operations to compute global results are carried out by CPUs. The performances of our parallel GMRES solver are evaluated on test matrices of sizes exceeding 107 rows. They show that solving large and sparse linear systems on a GPU cluster is faster than those performed on its CPU counterpart. It is noticed that a cluster of 12 GPUs is about 8 times faster than a cluster of 12 CPUs and about 5 times faster than a cluster of 24 CPUs.
机译:在本文中,我们提出了针对GPU集群的GMRES方法的高效并行实现。此实现要求我们在群集的CPU之间并行化GMRES算法。因此,对本地数据的所有并行和密集计算都在GPU上执行,而用于计算全局结果的归约运算则由CPU执行。我们的并行GMRES求解器的性能在尺寸超过107行的测试矩阵上进行了评估。他们表明,在GPU集群上求解大型稀疏线性系统比在其CPU上执行的系统更快。值得注意的是,由12个GPU组成的集群比由12个CPU组成的集群快约8倍,比由24个CPU组成的集群快约5倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号