Sparse matrix computations on clusters with GPGPUs

机译：与GPGPU的群集上的稀疏矩阵计算

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted some experiments regarding how to plug GPU-enabled computational kernels into PSBLAS, a MPI-based library specifically geared towards sparse matrix computations. In this paper, we present our findings on which strategies are more promising in the quest for the optimal compromise among raw performance, speedup, software maintainability, and extensibility. We consider several solutions to implement the data exchange with the GPU focusing on the data access and transfer, and present an experimental evaluation for a cluster system with up to two GPUs per node. In particular, we compare the pinned memory and the Open-MPI approaches, which are the two most used alternatives for multi-GPU communication in a cluster environment. We find that OpenMPI turns out to be the best solution for large data transfers, while the pinned memory approach is still a good solution for small transfers between GPUs.

机译：包含GPU的混合节点正在并行机器中的标准。我们对如何将支持GPU的计算内核插入PSBLA的一些实验，基于MPI的库专门针对稀疏矩阵计算。在本文中，我们在寻求原始性能，加速，软件可维护性和可扩展性方面追求最佳折衷的策略在哪些策略上展示了我们的调查结果。我们考虑使用专注于数据访问和传输的GPU实现数据交换的几个解决方案，并为每个节点的GPU提供了一个群集系统的实验评估。特别是，我们比较固定的内存和开放式MPI方法，这些方法是群集环境中多GPU通信的两个最使用的替代方案。我们发现OpenMPI证明是大数据传输的最佳解决方案，而固定的内存方法仍然是GPU之间的小转移的好解决方案。

著录项

来源
《International Conference on High Performance Computing Simulation》|2014年||共8页
会议地点
作者
Cardellini Valeria; Fanfarillo Alessandro; Filippone Salvatore;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类一般性问题;
关键词
Graphics processing units; Kernel; Libraries; Peer-to-peer computing; Performance evaluation; Sparse matrices; Synchronization; GPGPU computing; Message Passing Interface (MPI); Sparse matrices;

机译：图形处理单元;内核;库;对等网络计算;性能评价;稀疏矩阵;同步;GPGPU 计算;消息传递接口（ MPI）;稀疏矩阵;

相似文献

外文文献
中文文献
专利

1. Sparse Matrix-Vector Multiplication on GPGPUs [J] . Filippone Salvatore, Cardellini Valeria, Barbieri Davide, ACM transactions on mathematical software . 2017,第4期

机译：GPGPU上的稀疏矩阵向量乘法
2. A communication reduction approach to iteratively solve large sparse linear systems on a GPGPU cluster [J] . Chong Chen, Tarek M. Taha Cluster computing . 2014,第2期

机译：一种减少通信量的方法来迭代求解GPGPU集群上的大型稀疏线性系统
3. GPGPU Application to the Computation of Hamiltonian Matrix Elements between Non-orthogonal Slater Determinants in the Monte Carlo Shell Model [J] . Tomoaki Togashi, Noritaka Shimizu, Yutaka Utsuno, Procedia Computer Science . 2014,第1期

机译：GPGPU在蒙特卡洛壳模型中非正交Slater行列式之间的哈密顿矩阵元素计算中的应用
4. Sparse matrix computations on clusters with GPGPUs [C] . Cardellini Valeria, Fanfarillo Alessandro, Filippone Salvatore International Conference on High Performance Computing Simulation . 2014

机译：具有GPGPU的群集上的稀疏矩阵计算
5. Algorithm Architecture Co-Design for Dense and Sparse Matrix Computations [D] . Animesh, Saurabh. 2018

机译：密集和稀疏矩阵计算的算法架构协同设计
6. Computation of all eigenvalues of matrices used in restricted maximum likelihood estimation of variance components using sparse matrix techniques [O] . C Robert, V Ducrocq 1996

机译：使用稀疏矩阵技术计算方差分量的受限最大似然估计中使用的矩阵的所有特征值
7. Sparse computations on GPGPUs [O] . Barbieri D, Cardellini V, Filippone S 2012

机译：GPGPU上的稀疏计算
8. Parallel sparse matrix computations: Wavefront minimization of sparse matrices. Final report for the period ending June 14, 1998 [R] . 1999

机译：并行稀疏矩阵计算：稀疏矩阵的波前最小化。截至1998年6月14日的最终报告

Sparse matrix computations on clusters with GPGPUs

摘要

著录项

相似文献

相关主题

期刊订阅