首页> 外文会议>Proceedings of 9th Workshop on Latest Advances in Scalable Algorithms for LargeScale Systems >Low Thread-Count Gustavson: A Multithreaded Algorithm for Sparse Matrix-Matrix Multiplication Using Perfect Hashing
【24h】

Low Thread-Count Gustavson: A Multithreaded Algorithm for Sparse Matrix-Matrix Multiplication Using Perfect Hashing

机译:低线程数Gustavson:使用完美哈希的稀疏矩阵-矩阵乘法的多线程算法

获取原文
获取原文并翻译 | 示例

摘要

Sparse matrix-matrix multiplication is a critical kernel for several scientific computing applications, especially the setup phase of algebraic multigrid. The MPI+X programming model, which is growing in popularity, requires that such kernels be implemented in a way that exploits on-node parallelism. We present a single-pass OpenMP variant of Gustavson's sparse matrix matrix multiplication algorithm designed for architectures (e.g. CPU or Intel Xeon Phi) with reasonably large memory and modest thread counts (tens of threads, not thousands). These assumptions allow us to exploit perfect hashing and dynamic memory allocation to achieve performance improvements of up to 2× over third-party kernels for matrices derived from algebraic multigrid setup.
机译:稀疏矩阵矩阵乘法是一些科学计算应用程序(尤其是代数多重网格的设置阶段)的关键内核。 MPI + X编程模型越来越流行,它要求以利用节点上并行性的方式实现这种内核。我们介绍了Gustavson的稀疏矩阵矩阵乘法算法的单遍OpenMP变体,该算法专为架构(例如CPU或Intel Xeon Phi)设计,具有相当大的内存和适度的线程数(数十个线程,而不是数千个)。这些假设使我们能够利用完美的散列和动态内存分配,来实现对代数多重网格设置派生的矩阵的性能比第三方内核提高2倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号