首页> 外文会议>International semantic web conference >Sparklify: A Scalable Software Component for Efficient Evaluation of SPARQL Queries over Distributed RDF Datasets
【24h】

Sparklify: A Scalable Software Component for Efficient Evaluation of SPARQL Queries over Distributed RDF Datasets

机译:Sparklify:可扩展的软件组件,用于有效评估分布式RDF数据集上的SPARQL查询

获取原文
获取外文期刊封面目录资料

摘要

One of the key traits of Big Data is its complexity in terms of representation, structure, or formats. One existing way to deal with it is offered by Semantic Web standards. Among them, RDF - which proposes to model data with triples representing edges in a graph - has received a large success and the semantically annotated data has grown steadily towards a massive scale. Therefore, there is a need for scalable and efficient query engines capable of retrieving such information. In this paper, we propose Sparklify: a scalable software component for efficient evaluation of SPARQL queries over distributed RDF datasets. It uses Sparqlify as a SPARQL-to-SQL rewriter for translating SPARQL queries into Spark executable code. Our preliminary results demonstrate that our approach is more extensible, efficient, and scalable as compared to state-of-the-art approaches. Sparklify is integrated into a larger SANSA framework and it serves as a default query engine and has been used by at least three external use scenarios.
机译:大数据的主要特征之一是其表示形式,结构或格式的复杂性。语义Web标准提供了一种现有的处理方法。其中,RDF-提出用代表图表边缘的三元组来建模数据-已获得了巨大的成功,并且带有语义注释的数据已朝着大规模稳步增长。因此,需要能够检索此类信息的可伸缩且有效的查询引擎。在本文中,我们提出了Sparklify:可扩展的软件组件,用于在分布式RDF数据集上有效评估SPARQL查询。它使用Sparqlify作为SPARQL-to-SQL重写器,将SPARQL查询转换为Spark可执行代码。我们的初步结果表明,与最新方法相比,我们的方法更具扩展性,效率和可扩展性。 Sparklify已集成到一个较大的SANSA框架中,它用作默认查询引擎,并且至少已在三种外部使用场景中使用过。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号