首页> 外文会议>International conference on very large data bases;VLDB 2008 >Hexastore: Sextuple Indexing for Semantic Web Data Management
【24h】

Hexastore: Sextuple Indexing for Semantic Web Data Management

机译:Hexastore:用于语义Web数据管理的六元组索引

获取原文
获取外文期刊封面目录资料

摘要

Despite the intense interest towards realizing the Semantic Web vision, most existing RDF data management schemes are constrained in terms of efficiency and scalability. Still, the growing popularity of the RDF format arguably calls for an effort to offset these drawbacks. Viewed from a relational-database perspective, these constraints are derived from the very nature of the RDF data model, which is based on a triple format. Recent research has attempted to address these constraints using a vertical-partitioning approach, in which separate two-column tables are constructed for each property. However, as we show, this approach suffers from similar scalability drawbacks on queries that are not bound by RDF property value. In this paper, we propose an RDF storage scheme that uses the triple nature of RDF as an asset. This scheme enhances the vertical partitioning idea and takes it to its logical conclusion. RDF data is indexed in sir possible ways, one for each possible ordering of the three RDF elements. Each instance of an RDF element is associated with two vectors; each such vector gathers elements of one of the other types, along with lists of the third-type resources attached to each vector element. Hence, a sextuple-indexing scheme emerges. This format allows for quick and scalable general-purpose query processing; it confers significant advantages (up to five orders of magnitude) compared to previous approaches for RDF data management, at the price of a worst-case five-fold increase in index space. We experimentally document the advantages of our approach on real-world and synthetic data sets with practical queries.
机译:尽管人们对实现语义Web愿景怀有浓厚的兴趣,但大多数现有的RDF数据管理方案在效率和可伸缩性方面都受到限制。尽管如此,RDF格式的日益流行可以说需要努力弥补这些缺陷。从关系数据库的角度来看,这些约束源自RDF数据模型的本质,该模型基于三元格式。最近的研究尝试使用垂直分区方法来解决这些约束,其中为每个属性构造单独的两列表。但是,正如我们所展示的,这种方法在不受RDF属性值约束的查询上也存在类似的可伸缩性缺点。在本文中,我们提出了一种RDF存储方案,该方案使用RDF的三重性质作为资产。该方案增强了垂直分区的概念,并将其逻辑上得出结论。 RDF数据以可能的方式进行索引,对于三个RDF元素的每种可能的排序方式之一。 RDF元素的每个实例都与两个向量相关联。每个这样的向量收集其他类型之一的元素,以及附加到每个向量元素的第三种资源的列表。因此,出现了六元组索引方案。这种格式允许快速和可扩展的通用查询处理。与以前的RDF数据管理方法相比,它具有显着的优势(最多五个数量级),但代价是索引空间增加了最坏情况的五倍。我们通过实验记录了我们的方法在具有实际查询的现实世界和综合数据集上的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号