Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

Karl Rupp; Josef Weinbub; Ansgar Jüngel; Tibor Grasser

首页> 外文期刊>ACM transactions on mathematical software >Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

【24h】

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

机译：具有内核融合功能的流水线迭代求解器，用于图形处理单元

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We revisit the implementation of iterative solvers on discrete graphics processing units and demonstrate the benefit of implementations using extensive kernel fusion for pipelined formulations over conventional implementations of classical formulations. The proposed implementations with both CUDA and OpenCL are freely available in ViennaCL and are shown to be competitive with or even superior to other solver packages for graphics processing units. The highest-performance gains are obtained for small to medium-sized systems, while our implementations are on par with vendor-tuned implementations for very large systems. Our results are especially beneficial for transient problems, where many small to medium-sized systems instead of a single big system need to be solved.

机译：我们重新审视了离散图形处理单元上的迭代求解器的实现，并展示了与传统公式的传统实现相比，将大量内核融合用于流水线公式的实现的好处。用CUDA和OpenCL提出的实施方案可以在ViennaCL中免费获得，并显示出与图形处理单元的其他求解器程序包相比甚至更具竞争优势。对于中小型系统，可以获得最高的性能提升，而对于大型系统，我们的实现与供应商调整的实现相当。我们的结果对于瞬态问题特别有用，在瞬态问题中，需要解决许多中小型系统而不是单个大型系统的问题。

著录项

来源
《ACM transactions on mathematical software》 |2017年第2期|11.1-11.27|共27页
作者
Karl Rupp; Josef Weinbub; Ansgar Jüngel; Tibor Grasser;
展开▼
作者单位

TU Wien Wien Austria;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
BiCGStab method; CUDA; GMRES method; GPU; Iterative solvers; OpenCL; conjugate gradient method;

机译：BiCGStab方法;CUDA;GMRES方法;GPU;迭代求解器;OpenCL;共轭梯度法;

相似文献

外文文献
中文文献
专利

1. Acceleration of iterative Navier-Stokes solvers on graphics processing units [J] . Tomczak T., Zadarnowska K., Koza Z., International journal of computational fluid dynamics . 2013,第1a5期

机译：图形处理单元上迭代Navier-Stokes求解器的加速
2. Parallelizing flow-accumulation calculations on graphics processing units-From iterative DEM preprocessing algorithm to recursive multiple-flow-direction algorithm [J] . Cheng-Zhi Qin, Lijun Zhan Computers & geosciences . 2012,第期

机译：图形处理单元上并行的流量累积计算-从迭代DEM预处理算法到递归多流向算法
3. Performance evaluation of kernel fusion BLAS routines on the GPU: iterative solvers as case study [J] . S. Tabik, G. Ortega, E. M. Garzon Journal of supercomputing . 2014,第2期

机译：GPU上的内核融合BLAS例程的性能评估：迭代求解器作为案例研究
4. Krylov space iterative solvers on graphics processing units [C] . Dziekonski Adam, Mrozowski Michal 18th International Conference on Microwave Radar and Wireless Communications . 2010

机译：图形处理单元上的Krylov空间迭代求解器
5. Implementing a Preconditioned Iterative Linear Solver Using Massively Parallel Graphics Processing Units. [D] . Asgari Kamiabad, Amirhassan. 2011

机译：使用大规模并行图形处理单元实现预处理的迭代线性求解器。
6. High-Performance Iterative Electron Tomography Reconstruction with Long-Object Compensation using Graphics Processing Units (GPUs) [O] . Wei Xu, Fang Xu, Mel Jones, -1

机译：使用图形处理单元（GPU）具有长对象补偿的高性能迭代电子断层扫描重建
7. PIPELINED ITERATIVE SOLVERS WITH KERNEL FUSION FOR GRAPHICS PROCESSING UNITS [O] . K. Rupp, J. Weinbub, T. Grasser 2016

机译：具有核素融合的管道迭代求解器用于图形处理单元
8. Analysis and Implementation of Particle-to-Particle (P2P) Graphics Processor Unit (GPU) Kernel for Black-Box Adaptive Fast Multipole Method. [R] . Haney, R. H., Darve, E., Ansari, M. P., 2015

机译：黑盒自适应快速多极子粒子到粒子图形处理器单元（GpU）核的分析与实现。

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅