首页> 外文会议> >Fast Community Detection Algorithm with GPUs and Multicore Architectures
【24h】

Fast Community Detection Algorithm with GPUs and Multicore Architectures

机译:具有GPU和多核架构的快速社区检测算法

获取原文

摘要

In this paper, we present the design of a novel scalable parallel algorithm for community detection optimized for multi-core and GPU architectures. Our algorithm is based on label propagation, which works solely on local information, thus giving it the scalability advantage over conventional approaches. We also show that weighted label propagation can overcome typical quality issues in communities detected with label propagation. Experimental results on well known massive scale graphs such as Wikipedia (100M edges) and also on RMAT graphs with 10M - 40M edges, demonstrate the superior performance and scalability of our algorithm compared to the well known approaches for community detection. On the textit{hep-th} graph ($352$K edges) and the textit{wikipedia} graph ($100$M edges), using Power 6 architecture with $32$ cores, our algorithm achieves one to two orders of magnitude better performance compared to the best known prior results on parallel architectures with similar number of CPUs. Further, our GPGPU based algorithm achieves $8times$ improvement over the Power 6 performance on $40$M edge R-MAT graph. Alongside, we achieve high quality (modularity) of communities detected, with experimental evidence from well-known graphs such as Zachary karate club, Dolphin network and Football club, where we achieve modularity that is close to the best known alternatives. To the best of our knowledge these are best known results for community detection on massive graphs ($100$M edges) in terms of performance and also quality vs. performance trade-off. This is also a unique work on community detection on GPGPUs with scalable performance.
机译:在本文中,我们提出了一种针对社区检测的新型可扩展并行算法的设计,该算法针对多核和GPU架构进行了优化。我们的算法基于标签传播,它仅对本地信息起作用,因此具有比传统方法更高的可伸缩性优势。我们还显示,加权标签传播可以克服通过标签传播检测到的社区中的典型质量问题。在众所周知的大规模比例图(例如Wikipedia(100M边))上以及在具有10M-40M边的RMAT图上的实验结果证明,与众所周知的社区检测方法相比,我们的算法具有出色的性能和可伸缩性。在textit {hep-th}图($ 352 $ K边)和textit {wikipedia}图($ 100 $ M边)上,使用具有$ 32 $内核的Power 6架构,我们的算法与之相比,性能提高了一到两个数量级。在具有相似数量的CPU的并行体系结构上的最佳已知先验结果。此外,我们的基于GPGPU的算法在$ 40 $ M边缘R-MAT图表上比Power 6性能提高了$ 8倍。同时,我们还获得了检测到的社区的高质量(模块化),并获得了著名图表的实验证据,例如Zachary空手道俱乐部,海豚网络和Football俱乐部,在那里我们实现了与最知名替代方案接近的模块化。据我们所知,在性能,质量与性能之间的权衡方面,这是在大规模图形($ 100M边缘)上进行社区检测的最著名结果。这也是GPGPU上具有可扩展性能的社区检测方面的一项独特工作。

著录项

  • 来源
    《》|2011年|p.568-579|共12页
  • 会议地点
  • 作者

    Soman Jyothish; Narang Ankur;

  • 作者单位
  • 会议组织
  • 原文格式 PDF
  • 正文语种
  • 中图分类 TP311.133;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号