ROSIE: Runtime Optimization of SPARQL Queries over RDF Using Incremental Evaluation

机译：ROSIE：使用增量评估的RDF上SPARQL查询的运行时优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

RDF (Resource Description Framework) is a proposed standard for knowledge representation, with relational databases wildly adopted in RDF data management. For efficient evaluation of SPARQL queries over RDF data, the legacy query optimizer needs reconsiderations. One vital problem is how to tackle the suboptimal query plan caused by error-prone cardinality estimation. For RDF data, determine an optimal execution order before the query actually evaluated is costly, or even infeasible. In this paper, we propose ROSIE, a Runtime Optimization framework that iteratively re-optimize SPARQL query plan according to the actual cardinality derived from Incremental partial query Evaluation. By introducing an approach for heuristic-based plan generation, as well as a mechanism to detect cardinality estimation error at runtime, ROSIE relieves the problem of biased cardinality propagation in an efficient way. Extensive experiments on real and benchmark data have shown that, compared to the state-of-the-arts, ROSIE consistently outperformed on complex queries by orders of magnitude.

机译：RDF（资源描述框架）是提出的知识表示标准，RDF数据管理中广泛采用了关系数据库。为了对RDF数据进行SPARQL查询的有效评估，旧版查询优化器需要重新考虑。一个重要的问题是如何解决由于容易出错的基数估计而导致的次优查询计划。对于RDF数据，在实际评估的查询代价高昂甚至不可行之前，请确定最佳执行顺序。在本文中，我们提出了ROSIE，这是一个运行时优化框架，该框架根据从增量式局部查询评估得出的实际基数来迭代地重新优化SPARQL查询计划。通过引入一种基于启发式计划生成的方法，以及一种在运行时检测基数估计错误的机制，ROSIE可以有效地缓解基数有偏向传播的问题。在真实数据和基准数据上进行的大量实验表明，与最新技术相比，ROSIE在复杂查询上的性能始终比其高出几个数量级。

著录项

来源
《International conference on knowledge science, engineering and management》|2018年|117-131|共15页
会议地点
作者
Lei Gai; Xiaoming Wang; Tengjiao Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
SPARQL; RDF; Query optimization Cardinality estimation; Runtime optimization;

机译：SPARQL; RDF;查询优化基数估计;运行时优化;

相似文献

外文文献
中文文献
专利

1. BimSPARQL: Domain-specific functional SPARQL extensions for querying RDF building data [J] . Zhang Chi, Beetz Jakob, de Vries Bauke Semantic web . 2018,第6期

机译：bimsparql：用于查询RDF构建数据的域特定功能型SPARQL扩展
2. Keyword search over schema-less RDF datasets by SPARQL query compilation [J] . Izquierdo Yenier T., Garcia Grettel M., Menendez Elisa, Information Systems . 2021,第Deca期

机译：SparQL查询编译关键字在概要的RDF数据集中搜索
3. Completeness and soundness guarantees for conjunctive SPARQL queries over RDF data sources with completeness statements [J] . Darari Fariz, Nutt Werner, Razniewski Simon, Semantic web . 2020,第3期

机译：具有完整性陈述的RDF数据源对结合SPARQL查询的完整性和合理性保证
4. ROSIE: Runtime Optimization of SPARQL Queries over RDF Using Incremental Evaluation [C] . Lei Gai, Xiaoming Wang, Tengjiao Wang International Conference on Knowledge Science, Engineering and Management . 2018

机译：Rosie：使用增量评估，RDF对SPARQL查询的运行时优化
5. A new approach for fast processing of SPARQL queries on RDF quadruples [D] . Slavov, Vasil Georgiev 2015

机译：快速处理RDF四倍的SPARQL查询的新方法
6. SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases [O] . Hirokazu Chiba, Ikuo Uchiyama 2017

机译：SPANG：SPARQL客户端支持生成和重用分布式RDF数据库的查询
7. Evaluating SPARQL queries on massive RDF datasets [O] . Al-Harbi Razen, Abdelaziz Ibrahim, Kalnis Panos, 2015

机译：在大量RDF数据集上评估SPARQL查询

ROSIE: Runtime Optimization of SPARQL Queries over RDF Using Incremental Evaluation

摘要

著录项

相似文献

相关主题

期刊订阅