首页> 外文会议>Conference on Computing frontiers >Memory efficient parallel matrix multiplication operation for irregular problems
【24h】

Memory efficient parallel matrix multiplication operation for irregular problems

机译:内存有效的并行矩阵乘法运算可解决不规则问题

获取原文

摘要

Regular distributions for storing dense matrices on parallel systems are not always used in practice. In many scientific applicati RUMMA) [1] to handle irregularly distributed matrices. Our approach relies on a distribution independent algorithm that provides dynamic load balancing by exploiting data locality and achieves performance as good as the traditional approach which relies on temporary arrays with regular distribution, data redistribution, and matrix multiplication for regular matrices to handle the irregular case. The proposed algorithm is memory-efficient because temporary matrices are not needed. This feature is critical for systems like the IBM Blue Gene/L that offer very limited amount of memory per node. The experimental results demonstrate very good performance across the range of matrix distributions and problem sizes motivated by real applications.
机译:在实践中并不总是使用用于在并行系统上存储密集矩阵的规则分布。在许多科学应用中,RUMMA)[1]处理不规则分布的矩阵。我们的方法依赖于一种独立于分布的算法,该算法通过利用数据局部性来提供动态负载平衡,并获得了与传统方法相同的性能,传统方法依赖于具有规则分布的临时数组,数据重新分布以及规则矩阵的矩阵乘法来处理不规则情况。由于不需要临时矩阵,因此所提出的算法具有存储效率。对于像IBM Blue Gene / L这样的每个节点提供非常有限的内存量的系统而言,此功能至关重要。实验结果表明,在由实际应用引起的矩阵分布和问题大小范围内,它们都具有非常好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号