...
首页> 外文期刊>Complexity >Supergraph Topology Feature Index for Personalized Interesting Subgraph Query in Large Labeled Graphs
【24h】

Supergraph Topology Feature Index for Personalized Interesting Subgraph Query in Large Labeled Graphs

机译:超大标记图中个性化有趣的子图查询的超图拓扑功能索引

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Interesting subgraph query aims to find subgraphs that are isomorphic to the given query graph from a data graph and rank the subgraphs according to their interestingness scores. However, the existing subgraph query approaches are inefficient when dealing with large-scale labeled data graph. This is caused by the following problems: (i) the existing work mainly focuses on unweighted query graphs, while ignoring the impact of query constraints on query results. (ii) Excessive number of subgraph candidates or complex joins between nodes in the subgraph candidates reduce the query efficiency. To solve these problems, this paper proposes an intelligent solution. Firstly, an Isotype Structure Graph Compression (ISGC) strategy is proposed to compress similar nodes in a graph to reduce the size of the graph and avoid unnecessary matching. Then, an auxiliary data structure Supergraph Topology Feature Index (STFIndex) is designed to replace the storage of the original data graph and improve the efficiency of an online query. After that, a partition method based on Edge Label Step Value (ELSV) is proposed to partition the index logically. In addition, a novel Top-K interest subgraph query approach is proposed, which consists of the multidimensional filtering (MDF) strategy, upper bound value (UBV) (Size-c) matching, and the optimizational join (QJ) method to filter out as many false subgraph candidates as possible to achieve fast joins. We conduct experiments on real and synthetic datasets. Experimental results show that the average performance of our approach is 1.35 higher than that of the state-of-the-art approaches when the query graph is unweighted, and the average performance of our approach is 2.88 higher than that of the state-of-the-art approaches when the query graph is weighted.
机译:关注子查询旨在发现是同构的,从数据图给定的查询图,并根据自己的兴趣度分数排在子图子图。然而,随着大型标记的数据曲线图处理时,现有的子图的查询方法是低效的。这是由以下问题引起的:(i)根据现有的工作主要集中在加权查询图表,而忽略了查询约束对查询结果的影响。 (ⅱ)子图的候选数量过多或在子图的候选节点之间的复杂的连接减少查询效率。为了解决这些问题,本文提出的智能解决方案。首先,同种型结构格拉夫压缩(ISGC)的策略,提出了压缩类似的节点中的曲线图,以减少图的大小和避免不必要的匹配。然后,辅助数据结构母图拓扑要素指数(STFIndex)被设计成替换原始数据曲线图的存储和提高在线查询的效率。在此之后,基于边缘标签步长值(ELSV)分区方法,提出了进行逻辑分区的索引。另外,一种新型的前K个兴趣子查询方法提出,它由多维滤波(MDF)的策略,上界值(UBV)(尺寸-c)的匹配,并且最优性充分加入(QJ)方法筛选出的许多假子如考生能够实现高速连接。我们进行真实和合成数据集的实验。实验结果表明,我们的方法的平均性能比国家的最先进的,当查询图是不加权的方法,和我们的方法的平均性能比高出2.88了最先进的高1.35当查询图进行加权最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号