首页> 外文期刊>ACM transactions on the web >Top-k User-Defined Vertex Scoring Queries in Edge-Labeled Graph Databases
【24h】

Top-k User-Defined Vertex Scoring Queries in Edge-Labeled Graph Databases

机译:边缘标签图数据库中的前k个用户定义的顶点计分查询

获取原文
获取原文并翻译 | 示例

摘要

We consider identifying highly ranked vertices in large graph databases such as social networks or the Semantic Web where there are edge labels. There are many applications where users express scoring queries against such databases that involve two elements: (i) a set of patterns describing relationships that a vertex of interest to the user must satisfy and (ii) a scoring mechanism in which the user may use properties of the vertex to assign a score to that vertex. We define the concept of a partial pattern map query (partial PM-query), which intuitively allows us to prune partial matchings, and show that finding an optimal partial PM-query is NP-hard. We then propose two algorithms, PScore_LP and PScore_NWST, to find the answer to a scoring (top-k) query. In PScore_LP, the optimal partial PM-query is found using a list-oriented pruning method. PScore_NWST leverages node-weighted Steiner trees to quickly compute slightly sub-optimal solutions. We conduct detailed experiments comparing our algorithms with (i) an algorithm (PScore_Base) that computes all answers to the query, evaluates them according to the scoring method, and chooses the top-k, and (ii) two Semantic Web query processing systems (Jena and Graph DB). Our algorithms show better performance than PScore_Base and the Semantic Web query processing systems-moreover, PScore_NWST outperforms PScore_LP on large queries and on queries with a tree structure.
机译:我们考虑在大型图形数据库(例如具有边缘标签的社交网络或语义网)中识别高度排序的顶点。在许多应用程序中,用户对包含两个元素的数据库表达评分查询:(i)一组描述用户感兴趣的顶点必须满足的关系的模式;(ii)用户可以使用属性的评分机制顶点的分数来为该顶点分配分数。我们定义了部分模式图查询(部分PM查询)的概念,该概念直观地使我们能够修剪部分匹配,并表明找到最佳的部分PM查询是NP难的。然后,我们提出两种算法PScore_LP和PScore_NWST,以找到得分(前k位)查询的答案。在PScore_LP中,使用面向列表的修剪方法找到最佳的局部PM查询。 PScore_NWST利用节点加权的Steiner树快速计算稍微次优的解决方案。我们进行了详细的实验,将我们的算法与(i)一种算法(PScore_Base)进行比较,该算法计算查询的所有答案,并根据计分方法对它们进行评估,然后选择前k个,以及(ii)两个语义Web查询处理系统( Jena和Graph DB)。我们的算法显示出比PScore_Base和语义Web查询处理系统更好的性能-此外,在大型查询和具有树结构的查询上,PScore_NWST优于PScore_LP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号