【24h】

Pipelining in Multi-Query Optimization

机译:多查询优化中的流水线

获取原文
获取原文并翻译 | 示例

摘要

Database systems frequently have to execute a set of related queries, which share several common subexpressions. Multi-query optimization exploits this, by finding evaluation plans that share common results. Current approaches to multi-query optimization assume that common subexpressions are materialized. Significant performance benefits can be had if common subexpressions are pipelined to their uses, without being materialized. However, plans with pipelining may not always be realizable with limited buffer space, as we show. We present a general model for schedules with pipelining, and present a necessary and sufficient condition for determining validity of a schedule under our model. We show that finding a valid schedule with minimum cost is NP-hard. We present a greedy heuristic for finding good schedules. Finally, we present a performance study that shows the benefit of our algorithms on batches of queries from the TPCD benchmark.
机译:数据库系统经常必须执行一组相关的查询,这些查询共享几个常见的子表达式。多查询优化通过找到共享共同结果的评估计划来利用这一点。当前的多查询优化方法假设实现了通用子表达式。如果将通用子表达式传递给它们的使用而没有实现,则可以显着提高性能。但是,如流水线所示,并非总是可以通过有限的缓冲区空间来实现计划。我们提出了带有流水线的进度表的通用模型,并提出了在我们的模型下确定进度表有效性的必要和充分条件。我们表明,以最小的成本找到有效的时间表是NP难的。我们提出了一个贪婪的启发式方法,以寻找合适的时间表。最后,我们进行了一项性能研究,显示了我们的算法对TPCD基准测试中的大量查询的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号