首页> 外文会议> >Fast Community Detection Algorithm with GPUs and Multicore Architectures

【24h】

Fast Community Detection Algorithm with GPUs and Multicore Architectures

机译：具有GPU和多核架构的快速社区检测算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present the design of a novel scalable parallel algorithm for community detection optimized for multi-core and GPU architectures. Our algorithm is based on label propagation, which works solely on local information, thus giving it the scalability advantage over conventional approaches. We also show that weighted label propagation can overcome typical quality issues in communities detected with label propagation. Experimental results on well known massive scale graphs such as Wikipedia (100M edges) and also on RMAT graphs with 10M - 40M edges, demonstrate the superior performance and scalability of our algorithm compared to the well known approaches for community detection. On the textit{hep-th} graph ($352$K edges) and the textit{wikipedia} graph ($100$M edges), using Power 6 architecture with $32$ cores, our algorithm achieves one to two orders of magnitude better performance compared to the best known prior results on parallel architectures with similar number of CPUs. Further, our GPGPU based algorithm achieves $8times$ improvement over the Power 6 performance on $40$M edge R-MAT graph. Alongside, we achieve high quality (modularity) of communities detected, with experimental evidence from well-known graphs such as Zachary karate club, Dolphin network and Football club, where we achieve modularity that is close to the best known alternatives. To the best of our knowledge these are best known results for community detection on massive graphs ($100$M edges) in terms of performance and also quality vs. performance trade-off. This is also a unique work on community detection on GPGPUs with scalable performance.

机译：在本文中，我们提出了一种针对社区检测的新型可扩展并行算法的设计，该算法针对多核和GPU架构进行了优化。我们的算法基于标签传播，它仅对本地信息起作用，因此具有比传统方法更高的可伸缩性优势。我们还显示，加权标签传播可以克服通过标签传播检测到的社区中的典型质量问题。在众所周知的大规模比例图（例如Wikipedia（100M边））上以及在具有10M-40M边的RMAT图上的实验结果证明，与众所周知的社区检测方法相比，我们的算法具有出色的性能和可伸缩性。在textit {hep-th}图（$ 352 $ K边）和textit {wikipedia}图（$ 100 $ M边）上，使用具有$ 32 $内核的Power 6架构，我们的算法与之相比，性能提高了一到两个数量级。在具有相似数量的CPU的并行体系结构上的最佳已知先验结果。此外，我们的基于GPGPU的算法在$ 40 $ M边缘R-MAT图表上比Power 6性能提高了$ 8倍。同时，我们还获得了检测到的社区的高质量（模块化），并获得了著名图表的实验证据，例如Zachary空手道俱乐部，海豚网络和Football俱乐部，在那里我们实现了与最知名替代方案接近的模块化。据我们所知，在性能，质量与性能之间的权衡方面，这是在大规模图形（$ 100M边缘）上进行社区检测的最著名结果。这也是GPGPU上具有可扩展性能的社区检测方面的一项独特工作。

著录项

来源
《》|2011年|p.568-579|共12页
会议地点
作者
Soman Jyothish; Narang Ankur;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133;
关键词

相似文献

外文文献
中文文献
专利

1. Guest Editorial for the Special Section on BAlgorithm Vs. Architectures: Opportunities and Challenges in Multicore/GPU DSP [J] . Gwo Giun Chris Lee, Scott C. Kim Journal of signal processing systems for signal, image, and video technology . 2017,第3期

机译：BAlgorithm VS特别部分的客座社论。架构：多核/ GPU DSP中的机遇与挑战
2. SkyAlign: a portable, work-efficient skyline algorithm for multicore and GPU architectures [J] . Bogh Kenneth S., Chester Sean, Assent Ira The VLDB journal . 2016,第6期

机译：SkyAlign：适用于多核和GPU架构的便携式，高效工作的天际线算法
3. A hybrid parallel Barnes-Hut algorithm for GPU and multicore architectures [J] . Adrian Ozsoy Computing reviews . 2014,第4期

机译：适用于GPU和多核架构的混合并行Barnes-Hut算法
4. Fast Community Detection Algorithm with GPUs and Multicore Architectures [C] . Soman Jyothish, Narang Ankur IEEE International Parallel Distributed Processing Symposium . 2011

机译：GPU和多核架构快速群落检测算法
5. Fast Algorithms and Solvers in Computational Electromagnetics and Micromagnetics on GPUs. [D] . Li, Shaojing. 2012

机译：GPU上的计算电磁学和微电磁学的快速算法和求解器。
6. Multicore and GPU algorithms for Nussinov RNA folding [O] . Junjie Li, Sanjay Ranka, Sartaj Sahni 2014

机译：Nussinov RNA折叠的多核和GPU算法
7. A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures [O] . Mitch Horton, Stanimire Tomov, Jack Dongarra 2012

机译：一类用于多核和GPU架构的混合LAPACK算法

Fast Community Detection Algorithm with GPUs and Multicore Architectures

摘要

著录项

相似文献

相关主题

期刊订阅