首页> 美国卫生研究院文献>other >Any-k: Anytime Top-k Tree Pattern Retrieval in Labeled Graphs
【2h】

Any-k: Anytime Top-k Tree Pattern Retrieval in Labeled Graphs

机译:Any-k:随时在标签图中检索前k个树型

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many problems in areas as diverse as recommendation systems, social network analysis, semantic search, and distributed root cause analysis can be modeled as pattern search on labeled graphs (also called “heterogeneous information networks” or HINs). Given a large graph and a query pattern with node and edge label constraints, a fundamental challenge is to find the top-k matches according to a ranking function over edge and node weights. For users, it is difficult to select value k. We therefore propose the novel notion of an any-k ranking algorithm: for a given time budget, return as many of the top-ranked results as possible. Then, given additional time, produce the next lower-ranked results quickly as well. It can be stopped anytime, but may have to continue until all results are returned. This paper focuses on acyclic patterns over arbitrary labeled graphs. We are interested in practical algorithms that effectively exploit (1) properties of heterogeneous networks, in particular selective constraints on labels, and (2) that the users often explore only a fraction of the top-ranked results. Our solution, KARPET, carefully integrates aggressive pruning that leverages the acyclic nature of the query, and incremental guided search. It enables us to prove strong non-trivial time and space guarantees, which is generally considered very hard for this type of graph search problem. Through experimental studies we show that KARPET achieves running times in the order of milliseconds for tree patterns on large networks with millions of nodes and edges.
机译:可以将诸如推荐系统,社交网络分析,语义搜索和分布式根本原因分析等领域中的许多问题建模为标记图(也称为“异构信息网络”或HIN)上的模式搜索。给定一个大图和具有节点和边缘标签约束的查询模式,一个基本的挑战是根据边缘和节点权重的排名函数找到前k个匹配项。对于用户而言,很难选择值k。因此,我们提出了“任意k排序”算法的新颖概念:对于给定的时间预算,返回尽可能多的排名靠前的结果。然后,在给定额外时间的情况下,也可以快速产生下一个排名较低的结果。它可以随时停止,但可能必须继续直到返回所有结果。本文着重于任意标记图上的非循环模式。我们对有效利用(1)异构网络的属性,特别是标签的选择性约束,以及(2)用户经常仅浏览排名靠前的结果的一小部分的实用算法感兴趣。我们的解决方案KARPET精心集成了积极的修剪功能,该功能利用了查询的非循环性质和增量引导搜索。它使我们能够证明强大的非平凡时间和空间保证,对于这种类型的图搜索问题,通常很难做到这一点。通过实验研究,我们发现,在具有数百万个节点和边缘的大型网络上,KARPET的树形模式可实现毫秒级的运行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号