首页> 外文会议>ACM SIGMOD international conference on management of data >Finding Maximum Degrees in Hidden Bipartite Graphs
【24h】

Finding Maximum Degrees in Hidden Bipartite Graphs

机译:在隐藏的二分钟内找到最大程度

获取原文

摘要

An (edge) hidden graph is a graph whose edges are not explicitly given. Detecting the presence of an edge requires expensive edge-probing queries. We consider the k most connected vertex problem on hidden bipartite graphs. Specifically, given a bipartite graph G with independent vertex sets B and W, the goal is to find the k vertices in B with the largest degrees using the minimum number of queries. This problem can be regarded as a top-k extension of a semi-join, and is encountered in many applications in practice (e.g., top-k spatial join with arbitrarily complex join predicates). If B and W have n and m vertices respectively, the number of queries needed to solve the problem is nm in the worst case. This, however, is a pessimistic estimate on how many queries are necessary on practical data. In fact, on some easy inputs, the problem can be efficiently settled with only km+n edges, which is significantly lower than nm for k 《 n. The huge difference between km + n and nm makes it interesting to design an adaptive algorithm that is guaranteed to achieve the best possible performance on every input G. We give such an algorithm, and prove that it is instance optimal among a broad class of solutions. This means that, for any G, our algorithm can perform more queries than the optimal solution (which is currently unknown) by only a constant factor, which can be shown to be at most 2. Extensive experiments demonstrate that, in practice, the number of queries required by our technique is far less than nm, and agrees with our theoretical findings very well.
机译:(边缘)隐藏图是一个图形未明确给出的图形。检测边缘的存在需要昂贵的边缘探测查询。我们考虑隐藏的二角形图中的K大多数连接的顶点问题。具体地,给定具有独立顶点集B和W的二分图G,目标是使用最小查询数量的B中的k个顶点。此问题可以被视为半连接的顶级k扩展,并且在实践中的许多应用中遇到(例如,Top-K空间加入与任意复杂的加入谓词)。如果B和W分别具有N和M顶点,则在最坏的情况下解决问题所需的查询数量是NM。然而,这是对实际数据所必需的许多查询的悲观估算。事实上,在一些容易的输入上,问题可以有效地稳定,只有KM + n边缘,这显着低于K“n的NM。 KM + N和NM之间的巨大差异使得设计一个有趣的设计算法,保证在每个输入G上实现最佳性能。我们提供了这样的算法,并证明它是广泛的解决方案中的实例最佳。这意味着,对于任何G,我们的算法可以通过仅通过恒定因子来执行比最佳解决方案(当前未知)更加疑问,这可以显示最多2.大量实验表明,在实践中,在实践中表明该数量我们技术要求的查询远远低于NM,并同意我们的理论发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号