【24h】

Effectively Explaining Missing Nodes on Structural Graph Clustering

机译:有效地解释结构图聚类上的缺失节点

获取原文

摘要

To effectively manage and analyze graph data in many real-life graph applications, SCAN algorithm can efficiently help users to detect useful clusters, such as social networks, communication networks, gene networks, and so on. However, dirty data exist in graph data, for example, some edges are missing in graph data. In this case, the clustering results of SCAN over the dirty graph data cannot meet the users' requirements, such as, the missing nodes are not included in the desired clusters. To address this kind of problem, in this paper, we explore an effective explanation model to make the missing nodes be included in the desired clusters. The problem of explaining missing nodes in light of data modification is to explain why the missing nodes are not included in the desired clusters and how to make the missing nodes appear in the corresponding desired clusters by modifying the original graph dataset. To achieve this purpose, first, the clustering rational of SCAN algorithm is analyzed, and an unified explanation framework is proposed based on the analysis above. Moreover, based on the common explanation principle, the original clustering results should be maintained as much as possible in the new clustering results of SCAN, we design a penalty function to achieve this purpose. Then, we propose two explanation algorithms for making the missing nodes appear in the desired clusters by modifying the original graph dataset with minimum penalty value. Finally, comprehensive experiments are conducted, which demonstrate that the explored explanation model can efficiently explain the missing nodes on structural graph clustering.
机译:为了有效地管理和分析在许多真实图表应用中的图形数据,扫描算法可以有效地帮助用户检测有用的集群,例如社交网络,通信网络,基因网络等。但是,图形数据中存在脏数据,例如,图形数据中缺少一些边缘。在这种情况下,脏图数据扫描的聚类结果不能满足用户的要求,例如,缺失的节点不包括在所需的群集中。为了解决这种问题,在本文中,我们探讨了一个有效的解释模型,使丢失的节点包含在所需的集群中。解释缺少数据修改的缺失节点的问题是解释为什么丢失的节点不包括在所需的群集中,以及如何通过修改原始图数据集来使缺失的节点出现在相应的期望群集中。为达到此目的,首先,分析了扫描算法的聚类合理性,并且基于上述分析提出了统一的解释框架。而且,基于常见的解释原理,应在新的聚类扫描结果中尽可能多地维持原始聚类结果,我们设计了达到此目的的惩罚功能。然后,我们提出了两个解释算法,通过修改具有最小损失值的原始图形数据集来使缺失节点出现在所需的群集中。最后,进行了综合实验,表明探索的解释模型可以有效地解释结构图聚类上的缺失节点。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号