首页> 外文会议>Data Engineering, ICDE, 2009 IEEE 25th International Conference on >Efficient Indices Using Graph Partitioning in RDF Triple Stores
【24h】

Efficient Indices Using Graph Partitioning in RDF Triple Stores

机译:在RDF三重存储中使用图分区的有效索引

获取原文
获取外文期刊封面目录资料

摘要

With the advance of the Semantic Web, varying RDF data were increasingly generated, published, queried, and reused via the Web. For example, the DBpedia, a community effort to extract structured data from Wikipedia articles, broke 100 million RDF triples in its latest release. Initiated by Tim Berners-Lee,likewise, the Linking Open Data (LOD) project has published and interlinked many open licence datasets which consisted of over 2 billion RDF triples so far. In this context, fast query response over such large scaled data would be one of the challenges to existing RDF data stores. In this paper, we propose a novel triple indexing scheme to help RDF query engine fast locate the instances within a small scope. By considering the RDF data as a graph, we would partition the graph into multiple subgraph pieces and store them individually, over which a signature tree would be built up to index the URIs. When a query arrives, the signature tree index is used to fast locate the partitions that might include the matches of the query by its constant URIs. Our experiments indicate that the indexing scheme dramatically reduces the query processing time in most cases because many partitions would be early filtered out and the expensive exact matching is only performed over a quite small scope against the original dataset.
机译:随着语义Web的发展,越来越多的RDF数据通过Web生成,发布,查询和重用。例如,社区致力于从Wikipedia文章中提取结构化数据的DBpedia在其最新版本中突破了1亿个RDF三倍。同样,由蒂姆·伯纳斯·李(Tim Berners-Lee)发起,链接开放数据(LOD)项目已经发布并互连了许多开放许可证数据集,迄今为止,这些数据集包括超过20亿个RDF三元组。在这种情况下,对如此大规模的数据进行快速查询响应将是现有RDF数据存储的挑战之一。在本文中,我们提出了一种新颖的三重索引方案,以帮助RDF查询引擎在较小范围内快速定位实例。通过将RDF数据视为图形,我们可以将图形划分为多个子图并分别存储,然后在其上构建签名树来为URI编制索引。查询到达时,签名树索引用于通过其常量URI快速定位可能包含查询匹配项的分区。我们的实验表明,在大多数情况下,索引方案显着减少了查询处理时间,因为许多分区将被尽早过滤掉,而昂贵的精确匹配仅在很小的范围内针对原始数据集执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号