首页> 外文会议>IEEE international conference on data engineering >Top-K interesting subgraph discovery in information networks
【24h】

Top-K interesting subgraph discovery in information networks

机译:信息网络中的前K个有趣的子图发现

获取原文

摘要

In the real world, various systems can be modeled using heterogeneous networks which consist of entities of different types. Many problems on such networks can be mapped to an underlying critical problem of discovering top-K subgraphs of entities with rare and surprising associations. Answering such subgraph queries efficiently involves two main challenges: (1) computing all matching subgraphs which satisfy the query and (2) ranking such results based on the rarity and the interestingness of the associations among entities in the subgraphs. Previous work on the matching problem can be harnessed for a naïve ranking-after-matching solution. However, for large graphs, subgraph queries may have enormous number of matches, and so it is inefficient to compute all matches when only the top-K matches are desired. In this paper, we address the two challenges of matching and ranking in top-K subgraph discovery as follows. First, we introduce two index structures for the network: topology index, and graph maximum metapath weight index, which are both computed offline. Second, we propose novel top-K mechanisms to exploit these indexes for answering interesting subgraph queries online efficiently. Experimental results on several synthetic datasets and the DBLP and Wikipedia datasets containing thousands of entities show the efficiency and the effectiveness of the proposed approach in computing interesting subgraphs.
机译:在现实世界中,可以使用由不同类型的实体组成的异构网络对各种系统进行建模。此类网络上的许多问题都可以映射到一个基本的关键问题,即发现具有罕见且令人惊讶的关联的实体的前K个子图。有效地回答此类子图查询涉及两个主要挑战:(1)计算满足查询的所有匹配子图;(2)基于子图中实体之间关联的稀有性和趣味性对此类结果进行排名。可以利用以前在匹配问题上的工作来获得幼稚的“匹配后排名”解决方案。但是,对于大型图,子图查询可能具有大量匹配项,因此,当仅需要前K个匹配项时,计算所有匹配项效率很低。在本文中,我们解决了在前K个子图发现中匹配和排名的两个挑战,如下所述。首先,我们为网络引入两个索引结构:拓扑索引和图形最大元路径权重索引,它们都是离线计算的。其次,我们提出了新颖的top-K机制来利用这些索引来有效地在线回答有趣的子图查询。在几个合成数据集以及包含成千上万个实体的DBLP和Wikipedia数据集上的实验结果表明,该方法在计算有趣的子图中的效率和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号