首页> 外文会议>IEEE International Congress on Big Data >Rewriting complex SPARQL analytical queries for efficient cloud-based processing
【24h】

Rewriting complex SPARQL analytical queries for efficient cloud-based processing

机译:重写复杂的SPARQL分析查询,以了解基于高效的基于云的处理

获取原文

摘要

Many emerging Semantic Web applications combine and aggregate data across domains for analysis. Such analytical queries compute aggregates over multiple groupings of data, resulting in query plans with complex grouping-aggregation constraints. In the context of an RDF analytical query, each such grouping maps to a graph pattern subquery with multiple join operations, and related groups often result in overlapping graph patterns within the same query. In this paper, we propose a holistic approach to optimize RDF analytical queries by refactoring queries to achieve shared execution of common subexpressions that enables parallel evaluation of groupings as well as aggregations. Such a rewriting enables shorter execution workflows, particularly beneficial for scale-out processing on distributed Cloud systems with multiple I/O phases. Experiments on real-world and synthetic benchmarks confirm that such a rewriting can achieve more efficient execution plans when compared to relational-style SPARQL query plans executed on popular (Cloud systems.
机译:许多新兴语义Web应用程序组合和聚合域的数据以进行分析。此类分析查询在多个数据分组上计算聚合,导致具有复杂分组聚合约束的查询计划。在RDF分析查询的上下文中,每个这样的分组映射到具有多个连接操作的图形模式子查询,并且相关组经常导致同一查询内的重叠图形模式。在本文中,我们提出了一种整体方法,通过重构查询来实现RDF分析查询,以实现共享的常见子表达式的共享执行,这使得可以并行评估分组以及聚合。这种重写使得能够更短的执行工作流,特别有利于具有多个I / O阶段的分布式云系统上的扩展处理。与在流行(云系统上的关系风格的SPARQL查询计划相比,如今,在现实世界和合成基准测试中确认此类重写可以实现更高效的执行计划。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号