首页> 外文会议>International conference on knowledge science, engineering and management >ROSIE: Runtime Optimization of SPARQL Queries over RDF Using Incremental Evaluation
【24h】

ROSIE: Runtime Optimization of SPARQL Queries over RDF Using Incremental Evaluation

机译:ROSIE:使用增量评估的RDF上SPARQL查询的运行时优化

获取原文

摘要

RDF (Resource Description Framework) is a proposed standard for knowledge representation, with relational databases wildly adopted in RDF data management. For efficient evaluation of SPARQL queries over RDF data, the legacy query optimizer needs reconsiderations. One vital problem is how to tackle the suboptimal query plan caused by error-prone cardinality estimation. For RDF data, determine an optimal execution order before the query actually evaluated is costly, or even infeasible. In this paper, we propose ROSIE, a Runtime Optimization framework that iteratively re-optimize SPARQL query plan according to the actual cardinality derived from Incremental partial query Evaluation. By introducing an approach for heuristic-based plan generation, as well as a mechanism to detect cardinality estimation error at runtime, ROSIE relieves the problem of biased cardinality propagation in an efficient way. Extensive experiments on real and benchmark data have shown that, compared to the state-of-the-arts, ROSIE consistently outperformed on complex queries by orders of magnitude.
机译:RDF(资源描述框架)是提出的知识表示标准,RDF数据管理中广泛采用了关系数据库。为了对RDF数据进行SPARQL查询的有效评估,旧版查询优化器需要重新考虑。一个重要的问题是如何解决由于容易出错的基数估计而导致的次优查询计划。对于RDF数据,在实际评估的查询代价高昂甚至不可行之前,请确定最佳执行顺序。在本文中,我们提出了ROSIE,这是一个运行时优化框架,该框架根据从增量式局部查询评估得出的实际基数来迭代地重新优化SPARQL查询计划。通过引入一种基于启发式计划生成的方法,以及一种在运行时检测基数估计错误的机制,ROSIE可以有效地缓解基数有偏向传播的问题。在真实数据和基准数据上进行的大量实验表明,与最新技术相比,ROSIE在复杂查询上的性能始终比其高出几个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号