首页> 外文会议>International Conference on Advances in Social Networks Analysis and Mining >A Distributed Algorithm for Community Detection in Large Graphs
【24h】

A Distributed Algorithm for Community Detection in Large Graphs

机译:大图中社区检测的分布式算法

获取原文

摘要

Networks in various application domains present an internal structure, where nodes form groups of tightly connected components which are more loosely connected to the rest of the network. Several attempts have been made to provide a formal definition to the generally described "community finding" concept, providing different approaches. Some algorithms follow an iterative approach starting by characterizing either the entire network, or each individual node as community, and splitting [1] or merging communities respectively, producing a hierarchical tree of nested communities, called dendrogram. Several researchers aim to find the entire hierarchical community dendrogram [1] while others wish to identify only the optimal community partition. Some researchers aim at discovering distinct (non-overlapping) communities, while others allow for overlaps [2]. The Blondel algorithm described by Blondel et al. in [3], follows a bottom-up approach. Each node in the graph comprises a singleton community. Two communities are merged into one if the resulting community has larger modularity value that both the initial ones. This is a fast and accurate algorithm which, detects all communities in the graph. In suffers however in the sense that it constantly, during its execution, requires the knowledge of a global information of the graph, namely the number of its edges (which change as the algorithm modifies the graph), limiting its distributed nature. The Infomap algorithm [4] transforms the problem of community detection into efficiently compressing the structure of the graph, so that one can recovered almost the entire structure from the compressed form. This is achieved by minimizing a function that expresses the tradeoff between compression factor and loss of information (difference between the original graph and the reconstructed graph).
机译:各种应用域中的网络呈现内部结构,其中节点形成了更紧密地连接到网络的紧密连接的组件组。已经提出了几次尝试为通常描述的“社区发现”概念提供了一个正式的定义,提供了不同的方法。一些算法遵循通过将整个网络或每个单个节点作为社区的特征来开始的迭代方法,以及分别拆分[1]或合并社区,从而生成嵌套社区的分层树,称为Dendrogram。几位研究人员旨在找到整个分层社区树木[1],而其他研究则希望仅识别最佳的社区分区。一些研究人员旨在发现不同(非重叠)社区,而其他研究则允许重叠[2]。 Blondel等人描述的Blondel算法。在[3]中,遵循自下而上的方法。图中的每个节点包括单例社区。如果生成的社区具有较大的初始值,则两个社区被合并为一个。这是一种快速准确的算法,可检测图中的所有社区。然而,在遭受的情况下,它在执行期间,在执行期间,需要了解图的全局信息,即其边的数量(随着算法修改图形的变化),限制了其分布性质。 Infomap算法[4]将社区检测问题进行有效压缩图的结构,从而可以从压缩形式恢复几乎整个结构。这是通过最小化表示压缩因子与信息丢失之间的权衡的函数来实现的(原始图和重建图之间的差异)来实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号