首页> 外文期刊>Physica, A. Statistical mechanics and its applications >The (un)supervised NMF methods for discovering overlapping communities as well as hubs and outliers in networks
【24h】

The (un)supervised NMF methods for discovering overlapping communities as well as hubs and outliers in networks

机译:用于发现重叠社区以及网络中枢和离群点的(无)监督NMF方法

获取原文
获取原文并翻译 | 示例
           

摘要

For its crucial importance in the study of large-scale networks, many researchers devote to the detection of communities in various networks. It is now widely agreed that the communities usually overlap with each other. In some communities, there exist members that play a special role as hubs (also known as leaders), whose importance merits special attention. Moreover, it is also observed that some members of the network do not belong to any communities in a convincing way, and hence recognized as outliers. Failure to detect and exclude outliers will distort, sometimes significantly, the outcome of the detected communities. In short, it is preferable for a community detection method to detect all three structures altogether. This becomes even more interesting and also more challenging when we take the unsupervised assumption, that is, we do not assume the prior knowledge of the number K of communities. Our approach here is to define a novel generative model and formalize the detection of overlapping communities as well as hubs and outliers as an optimization problem on it. When K is given, we propose a normalized symmetric nonnegative matrix factorization algorithm based on Kullback-Leibler (KL) divergence to learn the parameters of the model. Otherwise, by combining KL divergence and prior model on parameters, we introduce another parameter learning method based on Bayesian symmetric nonnegative matrix factorization to learn the parameters of the model, while determining K. Therefore, we present a community detection method arguably in the most general sense, which detects all three structures altogether without prior knowledge of the number of communities. Finally, we test the proposed method on various real world networks. The experimental results, in contrast to several state-of-art algorithms, indicate its superior performance over other ones in terms of both clustering accuracy and community quality. (C) 2015 Elsevier B.V. All rights reserved.
机译:由于其在大规模网络研究中的至关重要性,许多研究人员致力于检测各种网络中的社区。现在,人们普遍同意,社区通常彼此重叠。在某些社区中,有些成员扮演着枢纽的特殊角色(也称为领导者),其重要性值得特别关注。此外,还观察到该网络的某些成员不具有说服力,不属于任何社区,因此被认为是异常值。如果无法检测到并排除异常值,有时会严重扭曲检测到的社区的结果。简而言之,对于社区检测方法而言,最好是全部检测全部三个结构。当我们采取无监督的假设时,这变得更加有趣并且也更具挑战性,也就是说,我们没有假设社区数量K的先验知识。我们在这里的方法是定义一个新颖的生成模型,并将重叠社区以及中心和离群值的检测形式化为对它的优化问题。当给出K时,我们提出了一种基于Kullback-Leibler(KL)散度的归一化对称非负矩阵分解算法,以学习模型的参数。否则,通过结合参数的KL散度和先验模型,我们引入了另一种基于贝叶斯对称非负矩阵分解的参数学习方法来学习模型的参数,同时确定K。因此,我们提出一种最普遍的社区检测方法感觉,它可以在不事先了解社区数量的情况下完全检测到所有三个结构。最后,我们在各种现实世界的网络上测试了该方法。与几种最新算法相比,实验结果表明,在聚类准确性和社区质量方面,其性能均优于其他算法。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号