首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks
【24h】

FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks

机译:FUSEMM:用于图形嵌入和图形神经网络的统一SDDMM-SPMM内核

获取原文

摘要

We develop a fused matrix multiplication kernel that unifies sampled dense-dense matrix multiplication and sparsedense matrix multiplication under a single operation called FusedMM. By using user-defined functions, FusedMM can capture almost all computational patterns needed by popular graph embedding and GNN approaches.FusedMM is an order of magnitude faster than its equivalent kernels in Deep Graph Library. The superior performance of FusedMM comes from the low-level vectorized kernels, a suitable load balancing scheme and an efficient utilization of the memory bandwidth. FusedMM can tune its performance using a code generator and perform equally well on Intel, AMD and ARM processors. FusedMM speeds up an end-to-end graph embedding algorithm by up to $28 imes$ on different processors. The source code is available at https://github.com/HipGraph/FusedMM.
机译:我们开发了一个融合的矩阵乘法内核,在一个名为Fumbermm的单个操作下统一采样的密集矩阵乘法和Sparsedense矩阵乘法。 通过使用用户定义的函数,融合MM可以捕获几乎流行图嵌入和GNN方法所需的所有计算模式.FusedMM是比深图库中的等效内核快的数量级。 熔融液的卓越性能来自低级矢量化内核,合适的负载平衡方案和存储带宽的有效利用率。 FUSEMM可以使用代码发生器调整其性能,并在英特尔,AMD和ARM处理器上进行同样良好。 融合速度将端到端图嵌入算法上的端到端图嵌入算法高达28美元 times $上的不同处理器。 源代码可用于https://github.com/hipgraph/fustmm。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号