首页> 外文OA文献 >Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems

【2h】

Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems

机译：具有显式的混合并行稀疏矩阵向量乘法当前基于多核的系统的通信重叠

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We evaluate optimized parallel sparse matrix-vector operations for severalrepresentative application areas on widespread multicore-based clusterconfigurations. First the single-socket baseline performance is analyzed andmodeled with respect to basic architectural properties of standard multicorechips. Beyond the single node, the performance of parallel sparse matrix-vectoroperations is often limited by communication overhead. Starting from theobservation that nonblocking MPI is not able to hide communication cost usingstandard MPI implementations, we demonstrate that explicit overlap ofcommunication and computation can be achieved by using a dedicatedcommunication thread, which may run on a virtual core. Moreover we identifyperformance benefits of hybrid MPI/OpenMP programming due to improved loadbalancing even without explicit communication overlap. We compare performanceresults for pure MPI, the widely used "vector-like" hybrid programmingstrategies, and explicit overlap on a modern multicore-based cluster and a CrayXE6 system.

机译：我们在广泛的基于多核的群集配置上评估了几个代表性应用领域的优化并行稀疏矩阵矢量运算。首先，针对标准多核芯片的基本架构特性对单路基准性能进行分析和建模。除了单个节点之外，并行稀疏矩阵矢量运算的性能通常受到通信开销的限制。从观察到无阻塞MPI无法使用标准MPI实现隐藏通信成本开始，我们证明了使用专用通信线程可以实现通信和计算的显式重叠，该线程可以在虚拟内核上运行。此外，即使没有明确的通信重叠，我们也可以通过改善负载平衡来确定MPI / OpenMP混合编程的性能优势。我们比较了纯MPI，广泛使用的“矢量样”混合编程策略以及在基于现代多核的群集和CrayXE6系统上的显式重叠的性能结果。

著录项

作者
Schubert, Gerald; Fehske, Holger; Hager, Georg; Wellein, Gerhard;
展开▼
作者单位

展开▼
年度 2011
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. HYBRID-PARALLEL SPARSE MATRIX-VECTOR MULTIPLICATION WITH EXPLICIT COMMUNICATION OVERLAP ON CURRENT MULTICORE-BASED SYSTEMS [J] . GERALD SCHUBERT HOLGER FEHSKEa GEORG HAGER GERHARD WELLEIN b Parallel Processing Letters . 2011,第3期

机译：电流多核系统上具有显式通信重叠的混合并行稀疏矩阵-矢量乘法
2. HYBRID-PARALLEL SPARSE MATRIX-VECTOR MULTIPLICATION WITH EXPLICIT COMMUNICATION OVERLAP ON CURRENT MULTICORE-BASED SYSTEMS [J] . GERALD SCHUBERT, HOLGER FEHSKE, GEORG HAGER, Parallel Processing Letters . 2011,第3期

机译：电流多核系统上具有显式通信重叠的混合并行稀疏矩阵-向量乘法
3. Hybrid-Parallel Sparse Matrix-Vector Multiplication and Iterative Linear Solvers with the communication library GPI [J] . Dimitar Stoyanov, Franz-Josef Pfreundt WSEAS Transactions on Information Science and Applications . 2014,第Null期

机译：带有通信库GPI的混合并行稀疏矩阵矢量乘法和迭代线性求解器
4. Hardware Mapping of a Parallel Algorithm for Matrix-Vector Multiplication Overlapping Communications and Computations [C] . International workshop on field-programmable logic and applications . 1998

机译：矩阵矢量乘法重叠通信和计算的并行算法的硬件映射
5. Analysis of High Performance Sparse Matrix-Vector Multiplication for Small Finite Fields [D] . Lambert, Matthew A. 2020

机译：小型有限字段高性能稀疏矩阵矢量乘法分析
6. Compressive Sensing Based Bayesian Sparse Channel Estimation for OFDM Communication Systems: High Performance and Low Complexity [O] . Guan Gui, Li Xu, Lin Shan, -1

机译：OFDM通信系统中基于压缩感知的贝叶斯稀疏信道估计：高性能和低复杂度
7. Performance limitations for sparse matrix-vector multiplications on current multicore environments [O] . Schubert Gerald, Hager Georg, Fehske Holger 2009

机译：稀疏矩阵向量乘法的性能限制当前的多核环境

Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅