Scaling sparse matrix-matrix multiplication in the accumulo database

Demirci Gunduz Vehbi; Aykanat Cevdet

首页> 外文期刊>Distributed and Parallel Databases >Scaling sparse matrix-matrix multiplication in the accumulo database

【24h】

Scaling sparse matrix-matrix multiplication in the accumulo database

机译：在累积数据库中缩放稀疏矩阵-矩阵乘法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose and implement a sparse matrix-matrix multiplication (SpGEMM) algorithm running on top of Accumulo's iterator framework which enables high performance distributed parallelism. The proposed algorithm provides write-locality while ingesting the output matrix back to database via utilizing row-by-row parallel SpGEMM. The proposed solution also alleviates scanning of input matrices multiple times by making use of Accumulo's batch scanning capability which is used for accessing multiple ranges of key-value pairs in parallel. Even though the use of batch-scanning introduces some latency overheads, these overheads are alleviated by the proposed solution and by using node-level parallelism structures. We also propose a matrix partitioning scheme which reduces the total communication volume and provides a balance of workload among servers. The results of extensive experiments performed on both real-world and synthetic sparse matrices show that the proposed algorithm scales significantly better than the outer-product parallel SpGEMM algorithm available in the Graphulo library. By applying the proposed matrix partitioning, the performance of the proposed algorithm is further improved considerably.

机译：我们提出并实现了一种在Accumulo迭代器框架之上运行的稀疏矩阵-矩阵乘法（SpGEMM）算法，该算法可实现高性能的分布式并行性。所提出的算法在提供写入局部性的同时，利用逐行并行SpGEMM将输出矩阵提取回数据库。所提出的解决方案还利用Accumulo的批量扫描功能减轻了输入矩阵的多次扫描，该功能用于并行访问多个范围的键-值对。即使使用批处理扫描会带来一些延迟开销，但这些提议的解决方案和使用节点级并行性结构都会减轻这些开销。我们还提出了一种矩阵分区方案，该方案可减少总通信量并提供服务器之间的工作负载平衡。在现实世界和合成稀疏矩阵上进行的大量实验的结果表明，与Graphulo库中可用的外部产品并行SpGEMM算法相比，所提出的算法可扩展性更好。通过应用所提出的矩阵划分，所提出的算法的性能被大大改善。

著录项

来源
《Distributed and Parallel Databases》 |2020年第1期|31-62|共32页
作者
Demirci Gunduz Vehbi; Aykanat Cevdet;
展开▼
作者单位

Bilkent Univ Dept Comp Engn Ankara Turkey;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Databases; NoSQL; Accumulo; Graphulo; Parallel and distributed computing; Sparse matrices; Sparse matrix-matrix multiplication; SpGEMM; Matrix partitioning; Graph partitioning; Data locality;

机译：数据库;NoSQL;Accumulo;Graphulo;并行和分布式计算;稀疏矩阵;稀疏矩阵矩阵乘法;SpGEMM;矩阵分区;图分区;数据局部性;

相似文献

外文文献
中文文献
专利

1. Scaling sparse matrix-matrix multiplication in the accumulo database [J] . Demirci Gunduz Vehbi, Aykanat Cevdet Ecological restoration . 2020,第1期

机译：缩放稀疏矩阵矩阵乘法在累计数据库中
2. Parallel sparse matrix-matrix multiplication: a scalable solution with 1D algorithm [J] . Mohammad Asadul Hoque, Rezaul Karim Raju, Christopher John Tymczak, International Journal of Computational Science and Engineering . 2015,第4期

机译：并行稀疏矩阵-矩阵乘法：具有一维算法的可扩展解决方案
3. Performance-Aware Model for Sparse Matrix-Matrix Multiplication on the Sunway TaihuLight Supercomputer [J] . Chen Yuedan, Li Kenli, Yang Wangdong, IEEE Transactions on Parallel and Distributed Systems . 2019,第4期

机译：Sunway TaihuLight超级计算机上稀疏矩阵-矩阵乘法的性能感知模型
4. Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale [C] . Md Taufique Hussain, Oguz Selvitopi, Aydin Buluç, IEEE International Parallel and Distributed Processing Symposium . 2021

机译：避免避免和内存受限的稀疏矩阵矩阵乘法
5. Efficient, scalable, parallel, matrix-matrix multiplication [D] . Portillo, Enrique 2013

机译：高效，可扩展，并行，矩阵矩阵乘法
6. A unified framework for sparse non-negative least squares using multiplicative updates and the non-negative matrix factorization problem [O] . Igor Fedorov, Alican Nalci, Ritwik Giri, -1

机译：使用乘法更新和非负矩阵分解问题的稀疏非负最小二乘的统一框架
7. Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale [O] . Md Taufique Hussain, Oguz Selvitopi, Aydin Buluc, 2021

机译：避免通信和内存受限稀疏矩阵矩阵矩阵

Scaling sparse matrix-matrix multiplication in the accumulo database

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅