首页> 外文会议>International conference on high performance computing for computational science >A Study of SpMV Implementation Using MPI and OpenMP on Intel Many-Core Architecture
【24h】

A Study of SpMV Implementation Using MPI and OpenMP on Intel Many-Core Architecture

机译:使用MPI和OpenMP在英特尔多核体系结构上实现SpMV的研究

获取原文

摘要

The Sparse Matrix-Vector Multiplication (SpMV) is fundamental to a broad spectrum of scientific and engineering applications, such as many iterative numerical methods. The widely used Compressed Sparse Row (CSR) sparse matrix storage format was chosen to carry on this study for sustainability and reusability reasons. We parallelized for Intel Many Integrated Core (MIC) architecture a vectorized SpMV kernel using MPI and OpenMP, both pure and hybrid versions of them. In comparison to pure models and vendor-supplied BLAS libraries across different mainstream architectures (CPU, GPU), the hybrid model exhibits a substantial improvement. To further assess the behavior of hybrid model, we attribute the inadequacy of performances to vectorization rate, irregularity of non-zeros, and load balancing issue. A mathematical relationship between the first two factors and the performance is then proposed based on the experimental data.
机译:稀疏矩阵向量乘法(SpMV)是广泛的科学和工程应用(例如许多迭代数值方法)的基础。出于可持续性和可重用性的原因,选择了广泛使用的压缩稀疏行(CSR)稀疏矩阵存储格式来进行此研究。我们使用MPI和OpenMP(它们的纯版本和混合版本)为Intel多集成核心(MIC)架构并行化了矢量化SpMV内核。与跨不同主流体系结构(CPU,GPU)的纯模型和供应商提供的BLAS库相比,混合模型表现出了很大的改进。为了进一步评估混合模型的行为,我们将性能的不足归因于矢量化率,非零的不规则性以及负载平衡问题。然后根据实验数据提出了前两个因素与性能之间的数学关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号