【24h】

Presto-RDF: SPARQL Querying over Big RDF Data

机译:Presto-RDF:SPARQL查询大RDF数据

获取原文

摘要

There has been a rapid increase in the amount of Resource Description Framework (RDF) data on the web. The processing of large volumes of RDF data requires an efficient storage and query-processing engine that can scale well with the volume of data. In the past two and half years, however, heavy users of big data systems, like Facebook, noted limitations with the query performance of these big data systems and began to develop new distributed query engines for big data that do not rely on map-reduce. Facebook's Presto is one such example. This paper proposes an architecture based on Presto, called Presto-RDF, that can be used to process big RDF data. An evaluation of performance of Presto in processing big RDF data against Apache Hive is also presented. The results of the experiments show that Presto-RDF framework has a much higher performance than Apache Hive and native RDF store - 4Store and it can be used to process big RDF data.
机译:Web上的资源描述框架(RDF)数据量迅速增加。处理大量RDF数据需要一个有效的存储和查询处理引擎,该引擎可以很好地随数据量扩展。但是,在过去的两年半中,像Facebook这样的大数据系统的重度用户注意到了这些大数据系统的查询性能的局限性,并开始为不依赖于map-reduce的大数据开发新的分布式查询引擎。 。 Facebook的Presto就是一个这样的例子。本文提出了一种基于Presto的架构,称为Presto-RDF,可用于处理大型RDF数据。还介绍了Presto在针对Apache Hive处理大型RDF数据时的性能评估。实验结果表明,Presto-RDF框架比Apache Hive和本机RDF存储-4Store具有更高的性能,可用于处理大型RDF数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号