【24h】

Distributed Efficient Provenance-Aware Regular Path Queries on Large RDF Graphs

机译:大型RDF图上的分布式高效起源感知常规路径查询

获取原文
获取外文期刊封面目录资料

摘要

With the proliferation of knowledge graphs, massive RDF graphs have been published on the Web. As an essential type of queries for RDF graphs, Regular Path Queries (RPQs) have been attracting increasing research efforts. However, the existing query processing approaches mainly focus on the standard semantics of RPQs. which cannot provide provenance of the answer sets. We propose dProvRPQ that is a distributed approach to evaluating provenance-aware RPQs over big RDF graphs. Our Pregel-based method employs Glushkov automata to keep track of matching processes of RPQs in parallel. Meanwhile, four optimization strategies are devised, including edge filtering, candidate states, message compression, and message selection, which can reduce the intermediate results of the basic dProvRPQ algorithm dramatically and overcome the counting-paths problem to some extent. The proposed algorithms are verified by extensive experiments on both synthetic and real-world datasets, which show that our approach can efficiently answer the provenance-aware RPQs over large RDF graphs.
机译:随着知识图的增长,大量的RDF图已经发布在Web上。作为RDF图的基本查询类型,规则路径查询(RPQ)一直吸引着越来越多的研究工作。但是,现有的查询处理方法主要集中在RPQ的标准语义上。不能提供答案集的来源。我们建议使用dProvRPQ,它是一种用于评估大型RDF图上可识别出处的RPQ的分布式方法。我们基于Pregel的方法采用Glushkov自动机来并行跟踪RPQ的匹配过程。同时,设计了四种优化策略,包括边缘过滤,候选状态,消息压缩和消息选择,可以显着减少基本dProvRPQ算法的中间结果,并在一定程度上克服了计数路径问题。通过对合成数据集和真实数据集进行的大量实验验证了所提出的算法,这表明我们的方法可以有效地解决大型RDF图上的出处识别RPQ。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号