首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >On the Spectral Characterization and Scalable Mining of Network Communities
【24h】

On the Spectral Characterization and Scalable Mining of Network Communities

机译:网络社区的频谱表征与可扩展挖掘

获取原文
获取原文并翻译 | 示例

摘要

Network communities refer to groups of vertices within which their connecting links are dense but between which they are sparse. A network community mining problem (or NCMP for short) is concerned with the problem of finding all such communities from a given network. A wide variety of applications can be formulated as NCMPs, ranging from social and/or biological network analysis to web mining and searching. So far, many algorithms addressing NCMPs have been developed and most of them fall into the categories of either optimization based or heuristic methods. Distinct from the existing studies, the work presented in this paper explores the notion of network communities and their properties based on the dynamics of a stochastic model naturally introduced. In the paper, a relationship between the hierarchical community structure of a network and the local mixing properties of such a stochastic model has been established with the large-deviation theory. Topological information regarding to the community structures hidden in networks can be inferred from their spectral signatures. Based on the above-mentioned relationship, this work proposes a general framework for characterizing, analyzing, and mining network communities. Utilizing the two basic properties of metastability, i.e., being locally uniform and temporarily fixed, an efficient implementation of the framework, called the LM algorithm, has been developed that can scalably mine communities hidden in large-scale networks. The effectiveness and efficiency of the LM algorithm have been theoretically analyzed as well as experimentally validated.
机译:网络社区指的是顶点组,在这些顶点组中,它们的连接链接密集,但在它们之间稀疏。网络社区挖掘问题(或简称NCMP)与从给定网络中查找所有此类社区有关。可以将各种应用程序表述为NCMP,范围从社交和/或生物网络分析到Web挖掘和搜索。到目前为止,已经开发了许多解决NCMP的算法,其中大多数都属于基于优化或启发式方法的类别。与现有研究不同,本文介绍的工作基于自然引入的随机模型的动态性,探索了网络社区及其属性的概念。本文利用大偏差理论建立了网络的分层社区结构与这种随机模型的局部混合特性之间的关系。关于网络中隐藏的社区结构的拓扑信息可以从其频谱特征中推断出来。基于上述关系,本工作提出了用于表征,分析和挖掘网络社区的通用框架。利用亚稳态的两个基本属性,即局部统一和临时固定,已开发出一种有效的框架实现方法,称为LM算法,可以大规模挖掘隐藏在大规模网络中的社区。 LM算法的有效性和效率已在理论上进行了分析和实验验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号