...
首页> 外文期刊>Parallel Computing >Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model
【24h】

Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model

机译:使用块和任务编程模型的局部性并行块稀疏矩阵矩阵乘法

获取原文
获取原文并翻译 | 示例

摘要

We present a method for parallel block-sparse matrix-matrix multiplication on distributed memory clusters. By using a quadtree matrix representation, data locality is exploited without prior information about the matrix sparsity pattern. A distributed quadtree matrix representation is straightforward to implement due to our recent development of the Chunks and Tasks programming model [Parallel Comput. 40, 328 (2014)]. The quadtree representation combined with the Chunks and Tasks model leads to favorable weak and strong scaling of the communication cost with the number of processes, as shown both theoretically and in numerical experiments.
机译:我们提出了一种在分布式存储集群上并行块稀疏矩阵矩阵乘法的方法。通过使用四叉树矩阵表示,可以在没有有关矩阵稀疏模式的先验信息的情况下利用数据局部性。分布式四叉树矩阵表示很容易实现,这是由于我们最近开发的块和任务编程模型[并行计算。 40,328(2014)]。四叉树表示法与Chunks and Tasks模型相结合,导致通信成本随过程数量而变弱和变强,如理论和数值实验所示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号