首页> 外文会议>Biomedical data management and graph online querying >S2X: Graph-Parallel Querying of RDF with GraphX
【24h】

S2X: Graph-Parallel Querying of RDF with GraphX

机译:S2X:使用GraphX进行RDF的图并行查询

获取原文
获取原文并翻译 | 示例

摘要

RDF has constantly gained attention for data publishing due to its flexible data model, raising the need for distributed querying. However, existing approaches using general-purpose cluster frameworks employ a record-oriented perception of RDF ignoring its inherent graph-like structure. Recently, GraphX was published as a graph abstraction on top of Spark, an in-memory cluster computing system. It allows to seamlessly combine graph-parallel and data-parallel computation in a single system, an unique feature not available in other systems. In this paper we introduce S2X, a SPARQL query processor for Hadoop where we leverage this unified abstraction by implementing basic graph pattern matching of SPARQL as a graph-parallel task while other operators are implemented in a datarparallel manner. To the best of our knowledge, this is the first approach to combine graph-parallel and data-parallel computation for SPARQL querying of RDF data based on Hadoop.
机译:RDF由于其灵活的数据模型而一直引起数据发布的关注,从而增加了对分布式查询的需求。但是,使用通用集群框架的现有方法在忽略RDF固有的类似于图形的结构的情况下采用了RDF的面向记录的感知。最近,GraphX作为图形抽象发布在内存中的群集计算系统Spark之上。它允许在单个系统中无缝组合图形并行和数据并行计算,这是其他系统中不具备的独特功能。在本文中,我们介绍了S2X,这是一种用于Hadoop的SPARQL查询处理器,在其中我们通过实现SPARQL的基本图形模式匹配作为图并行任务来利用这种统一抽象,而其他运算符以数据并行方式来实现。据我们所知,这是将图并行计算和数据并行计算相结合用于基于Hadoop的RDF数据的SPARQL查询的第一种方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号