【24h】

Exploring Controlled RDF Distribution

机译:探索受控的RDF分布

获取原文

摘要

RDF datasets have increased rapidly over the last few years. In order to process SPARQL queries on these large datasets, much effort has been spent on developing horizontally scalable techniques, which involve data partitioning and parallel query processing. While distribution may provide storage scalability, it may also incur high communication costs for processing queries. In this paper, we present a parallel and distributed query rocessing approach that explores the existence of data allocation patterns, provided by a controlled data distribution, that determine how RDF triples should be grouped and stored on the same server. Fragments of the RDF datastore follow a given allocation pattern and correspond also to units of communication among servers. Based on this distribution model, we define two communication strategies for query processing: get-frag, which requests remote servers to send fragments that contain data required by a query, and send-result, which forwards intermediate results. These strategies are combined on a method, called 2ways, that chooses the adequate communication strategy whenever queries traverse fragment boundaries. We provide a cost function used to determine this choice and present experimental results. They show that our proposed technique effectively reduces the communication cost and improves the response time for processing SPARQL queries on a distributed RDF datastore.
机译:RDF数据集在过去几年中迅速增长。为了处理这些大型数据集上的SPARQL查询,已经花费了很多精力来开发水平可伸缩的技术,该技术涉及数据分区和并行查询处理。尽管分发可以提供存储可伸缩性,但是它也可能会导致处理查询的通信成本较高。在本文中,我们提出了一种并行且分布式的查询处理方法,该方法探索了由受控数据分发提供的数据分配模式的存在,该模式确定了应如何将RDF三元组分组并存储在同一服务器上。 RDF数据存储区的片段遵循给定的分配模式,并且还对应于服务器之间的通信单元。基于此分发模型,我们定义了两种用于查询处理的通信策略:get-frag(请求远程服务器发送包含查询所需数据的片段)和send-result(转发中间结果)。这些策略结合在称为2way的方法上,该方法在查询遍历片段边界时选择适当的通信策略。我们提供了一个成本函数,用于确定这一选择并提供实验结果。他们表明,我们提出的技术有效地降低了通信成本,并改善了在分布式RDF数据存储上处理SPARQL查询的响应时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号