首页> 外文期刊>SIGKDD explorations >d2k: Scalable Community Detection in Massive Networks via Small-Diameter k-Plexes
【24h】

d2k: Scalable Community Detection in Massive Networks via Small-Diameter k-Plexes

机译:D2K:通过小直径k-plex的大规模网络中的可扩展社区检测

获取原文
获取原文并翻译 | 示例
           

摘要

This paper studies k-plexes, a well known pseudo-clique model for network communities. In a k-plex, each node can miss at most k ? 1 links. Our goal is to detect large communities in today's real-world graphs which can have hundreds of millions of edges. While many have tried, this task has been elusive so far due to its computationally challenging nature: k-plexes and other pseudo-cliques are harder to find and more numerous than cliques, a well known hard problem. We present d2k, which is the first algorithm able to find large k-plexes of very large graphs in just a few minutes. The good performance of our algorithm follows from a combination of graph-theoretical concepts, careful algorithm engineering and a high-performance implementation. In particular, we exploit the low degeneracy of real-world graphs, and the fact that large enough k-plexes have diameter 2. We validate a sequential and a parallel/distributed implementation of d2k on real graphs with up to half a billion edges.
机译:本文研究了K-PLEXES,是网络社区的众所周知的伪集团模型。 在k-plex中,每个节点都可能错过k? 1个链接。 我们的目标是在今天的真实图中检测大型社区,这可以有数亿边缘。 虽然许多人已经尝试过,但到目前为止,这项任务是由于其计算挑战性的性质:K-Plexes和其他伪派系更难以找到,并且比派系更众多,众所周知的难题。 我们呈现D2K,这是一个能够在几分钟内找到大型k-plex的算法。 我们的算法的良好表现从图形理论概念,仔细算法工程和高性能实现的组合之后。 特别是,我们利用现实世界图的低退化性,并且足够大的K-Plexes具有直径2的事实。我们验证了在最多有5亿边缘的真实图表上的顺序和并行/分布式的D2K。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号