Progressive Query Optimization for Federated Queries

机译：联合查询的渐进式查询优化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Database Management Systems (DBMS) perform query plan selection by mathematically modeling the execution cost of candidate execution plans and choosing the cheapest query execution plan (QEP) according to that cost model. The cost model requires accurate estimates of the sizes of intermediate results of all steps in the QEP. Outdated or incomplete statistics, parameter markers and complex skewed data frequently cause the selection of a suboptimal query plan, which in turn results in bad query performance. Federated queries are regular relational queries accessing data on one or more remote relational or non-relational data sources, possibly combining them with tables stored in the federated DBMS server. Their execution is typically divided between the federated server and the remote data sources. Outdated and incomplete statistics have a bigger impact on federated DBMS than on regular DBMS, as maintenance of federated statistics is unequally more complicated and expensive than the maintenance of the local statistics; consequently bad performance commonly occurs for federated queries due to the selection of a suboptimal query plan. We present an extension of the mid-query reoptimiza-tion technique "Progressive Query Optimization" (POP), which adds robustness to query processing by dynamically detecting if an access plan is suboptimal and by triggering a reoptimization in that case. Our extensions enable efficient reoptimization of federated queries. Our contributions are (a) an opportunistic, but risk controlled, reoptimization technique for federated DBMS (b) a technique for multiple reoptimizations during federated query processing, with a strategy to discover redundant and eliminate partial results and (c) a mechanism to eagerly procure statistics in a federated environment. We have implemented these techniques in a prototype version of WebSphere Information Integrator for DB2. Our enhancements enable robust and acceptable performance for federated queries, even if the remote data sources provided almost no statistical information about the data. An extensive case study on real world data shows POP has negligible runtime overhead and improves the performance of complex federated queries by up to a full order of magnitude.

机译：数据库管理系统（DBMS）通过对候选执行计划的执行成本进行数学建模并根据该成本模型选择最便宜的查询执行计划（QEP）来执行查询计划选择。成本模型要求准确评估QEP中所有步骤的中间结果的大小。统计信息过时或不完整，参数标记和复杂的偏斜数据经常导致选择次优查询计划，从而导致查询性能下降。联合查询是访问一个或多个远程关系或非关系数据源上的数据的常规关系查询，可能将它们与存储在联合DBMS服务器中的表进行组合。它们的执行通常在联合服务器和远程数据源之间分配。过时和不完整的统计信息对联邦DBMS的影响比对常规DBMS的影响要大，这是因为维护联邦统计信息比维护本地统计信息更为复杂和昂贵。因此，由于选择了次优的查询计划，联合查询的性能通常很差。我们提出了中间查询重新优化技术“渐进查询优化”（POP）的扩展，该技术通过动态检测访问计划是否欠佳并在这种情况下触发重新优化，为查询处理增加了鲁棒性。我们的扩展可以有效地优化联合查询。我们的贡献是（a）联合DBMS的机会主义但受风险控制的重新优化技术（b）联合查询处理期间的多次重新优化技术，以及发现冗余并消除部分结果的策略，以及（c）急于采购的机制联合环境中的统计信息。我们已经在WebSphere Information Integrator for DB2的原型版本中实现了这些技术。即使远程数据源几乎不提供有关数据的统计信息，我们的增强功能也可以为联合查询提供鲁棒且可接受的性能。大量有关现实世界数据的案例研究表明，POP的运行时开销可以忽略不计，并将复杂的联合查询的性能提高了一个完整的数量级。

著录项

来源
《International Conference on Extending Database Technology(EDBT 2006); 20060326-31; Munich(DE)》|2006年|P.847-864|共18页
会议地点 Munich(DE)
作者
Stephan Ewen; Holger Kache; Volker Markl; Vijayshankar Raman;
展开▼
作者单位

IBM Germany, Am Fichtenberg 1, 71083 Herrenberg, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
入库时间 2022-08-26 14:11:47

相似文献

外文文献
中文文献
专利

1. Optimized ontology-driven query expansion using map-reduce framework to facilitate federated queries [J] . Neda Alipanah, Latifur Khan, Bhavani Thurisingham International Journal of Computer Systems Science & Engineering . 2012,第2期

机译：使用map-reduce框架优化了本体驱动的查询扩展，以促进联合查询
2. Optimizing Multi-Query Evaluation in Federated RDF Systems [J] . Peng Peng, Ge Qi, Zou Lei, IEEE Transactions on Knowledge and Data Engineering . 2021,第4期

机译：优化联合RDF系统中的多查询评估
3. A Study On Query Optimization for Federated Database Systems [J] . Xinhua Xu Computer and Information Science . 2009,第1期

机译：联合数据库系统查询优化研究
4. POP/FED: Progressive Query Optimization for Federated Queries in DB2 [C] . Holger Kache, Wook-Shin Han, Volker Markl, 32nd International Conference on Very Large Data Bases(VLDB 2006) vol.2 . 2006

机译：POP / FED：DB2中联合查询的渐进式查询优化
5. Beyond relational: A database architecture and federated query optimization in a multi-modal healthcare environment [D] . Hylock, Ray Hales 2013

机译：超越关系：多模式医疗环境中的数据库架构和联合查询优化
6. BioFed: federated query processing over life sciences linked open data [O] . Ali Hasnain, Qaiser Mehmood, Syeda Sana e Zainab, 2017

机译：BioFed：生命科学上的联合查询处理链接了开放数据
7. Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases [O] . Kementsietsidis, Anastasios, Neven, Frank, Van de Craen, Dieter, 2008

机译：联邦科学数据库中的探索性查询的可扩展多查询优化

Progressive Query Optimization for Federated Queries

摘要

著录项

相似文献

相关主题

期刊订阅