首页> 外文学位 >The impact of early grouping and user-defined functions on query optimization.
【24h】

The impact of early grouping and user-defined functions on query optimization.

机译:早期分组和用户定义的功能对查询优化的影响。

获取原文
获取原文并翻译 | 示例

摘要

On-line Analytical Processing (or OLAP) is a new class of query processing for large-scaled database systems. It provides a quick, responsive way for the users of Decision Support System (DSS) to navigate through the large amount of data in big organizations. To achieve the required performance, the frequently requested aggregate queries are precomputed (or materialized) and stored in a centralized repository, called Data Warehouse. Due to the large amount of data, the query optimizers must devise optimal plans for the computation of these materialized views to meet the user requirements.; Traditional two-phase optimization approach for aggregate queries, i.e., optimizing the query without considering the GROUP BY and aggregation, and appending the aggregation on the resulting plan in the former process, is not guaranteed to produce optimal plans. A new technique that evaluates the GROUP BY operators early in query optimization provides more opportunities for the optimizers to find the optimal plans. However, pushing down the GROUP BY operator also increase the search space dramatically. The first part of the thesis is to derive heuristics that will reduce the search space in a cost-based optimization.; The second part of the thesis extends the optimizer's ability to generate plans for queries with holistic aggregate functions, using the early grouping technique. One difficulty is that the evaluation of holistic functions cannot be started until all data are collected, which is not compatible with the early grouping technique. In the early grouping approach, data are evaluated by partitions and the results of the partitions are merged in the final stage. The thesis provides a method to start the evaluation by partially aggregating the input data even not all data are completely collected.; The third part of the thesis enhances the database system to allow users define their own grouping attributes. This contribution allows users to generate new information from the existing data. The new ability, however, increases the burden to the optimizer to find an optimal plan when the attributes involved are derived from some other attributes. The thesis provides a new evaluation method and a thorough cost analysis of the new model. It offers the optimizers new opportunities to generate more cost efficient plans.
机译:联机分析处理(OLAP)是用于大型数据库系统的一类新的查询处理。它为决策支持系统(DSS)的用户提供了一种快速响应的方式,可以浏览大型组织中的大量数据。为了获得所需的性能,通常会计算(或实现)经常请求的聚合查询并将其存储在称为“数据仓库”的集中存储库中。由于数据量很大,查询优化器必须为计算这些物化视图设计最佳计划,以满足用户需求。对于聚合查询的传统两阶段优化方法,即在不考虑GROUP BY和聚合的情况下优化查询,并不能在前一过程中将聚合附加到生成的计划上,无法保证产生最佳计划。一种在查询优化中尽早评估GROUP BY运算符的新技术为优化器找到最佳计划提供了更多机会。但是,按下GROUP BY运算符也会大大增加搜索空间。本文的第一部分是推导启发式算法,它将在基于成本的优化中减少搜索空间。论文的第二部分使用早期分组技术扩展了优化器生成具有全部聚合函数的查询计划的能力。一个困难是,在收集所有数据之前无法开始对整体功能进行评估,这与早期的分组技术不兼容。在早期分组方法中,按分区评估数据,并在最后阶段合并分区的结果。本文提供了一种通过部分汇总输入数据(即使不是所有数据都被完全收集)来开始评估的方法。论文的第三部分改进了数据库系统,允许用户定义自己的分组属性。这种贡献使用户可以从现有数据中生成新信息。但是,当所涉及的属性是从某些其他属性派生时,新功能增加了优化器寻找最佳计划的负担。本文提供了一种新的评估方法,并对新模型进行了全面的成本分析。它为优化人员提供了产生更具成本效益的计划的新机会。

著录项

  • 作者

    Chiou, Shao-Fong.;

  • 作者单位

    University of Massachusetts Lowell.;

  • 授予单位 University of Massachusetts Lowell.;
  • 学科 Computer Science.
  • 学位 D.Sc.
  • 年度 1999
  • 页码 120 p.
  • 总页数 120
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号