首页> 外文期刊>Future generation computer systems >Combination of links and node contents for community discovery using a graph regularization approach
【24h】

Combination of links and node contents for community discovery using a graph regularization approach

机译:使用图正则化方法组合链接和节点内容以进行社区发现

获取原文
获取原文并翻译 | 示例
           

摘要

With the rapid growth of the networked data, the study of community detection is drawing increasing attention of researchers. A number of algorithms have been proposed and some of them have been well applied in many research fields, such as recommendation systems, information retrieval, etc. Traditionally, the community detection methods mainly use the knowledge of the topological structure which contains the most important clue for finding potential groups or communities. However, as we know, a wealth of content information exists on the nodes in real-world networks, and may help for community detection. Considering the above problem, we introduce a novel community detection method under the framework of nonnegative matrix factorization (NMF), and adopt the idea that two nodes with similar content will be most likely to belong to the same community to achieve the incorporation of links and node contents, i.e., we employ a graph regularization to penalize the dissimilarity of nodes denoted by community memberships. Besides, we introduce an intuitive manifold learning strategy to recover the intrinsic geometrical structure of the content information, i.e., K-near neighbor consistency. In addition, we found that, there are still drawbacks in this framework due to it does not consider the heterogeneous distribution of node degrees. This heterogeneous distribution can affect the function of graph regularization and isolates the original community memberships. We first proposed the node popularities satisfying the above interpretation and develop a new NMF-based model, named as Combination of Links and Node Contents for Community Discovery (CLNCCD). The experiments on both artificial and real-world networks compared with the state-of-the-art methods show that, the new model obtains significant improvement for community detection by incorporating node contents effectively. (C) 2018 Published by Elsevier B.V.
机译:随着网络数据的快速增长,社区检测的研究日益引起研究人员的关注。已经提出了许多算法,其中一些算法已经在推荐系统,信息检索等许多研究领域中得到了很好的应用。传统上,社区检测方法主要利用包含最重要线索的拓扑结构知识。寻找潜在的群体或社区。但是,众所周知,现实网络中的节点上存在大量的内容信息,可能有助于社区发现。考虑到上述问题,我们在非负矩阵分解(NMF)框架下引入了一种新颖的社区检测方法,并采用了内容相似的两个节点最有可能属于同一个社区的思想来实现链接和链接的合并。节点内容,即,我们使用图正则化来惩罚由社区成员资格表示的节点的不相似性。此外,我们引入了一种直观的流形学习策略来恢复内容信息的内在几何结构,即K-近邻一致性。另外,我们发现,由于该框架没有考虑节点度的异构分布,因此在该框架中仍然存在缺陷。这种异构分布会影响图正则化的功能并隔离原始社区成员身份。我们首先提出了满足以上解释的节点流行度,并开发了一种基于NMF的新模型,称为链接和社区发现节点内容组合(CLNCCD)。在人工和现实网络上进行的实验与最新技术的比较表明,新模型通过有效地合并节点内容,在社区检测方面取得了显着改进。 (C)2018由Elsevier B.V.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号