首页> 外文会议>International conference on very large data bases >Guided automated learning for query workload re-optimization
【24h】

Guided automated learning for query workload re-optimization

机译:引导式自动学习,可重新优化查询工作负载

获取原文

摘要

Query optimization is a hallmark of database systems. When an SQL query runs more expensively than is viable or warranted, determination of the performance issues is usually performed manually in consultation with experts through the analysis of query's execution plan (QEP). However, this is an excessively time consuming, human error-prone, and costly process. GALO is a novel system that automates this process. The tool automatically learns recurring problem patterns in query plans over workloads in an offline learning phase, to build a knowledge base of plan-rewrite remedies. It then uses the knowledge base online to re-optimize queries often quite drastically. GALO's knowledge base is built on RDF and SPARQL, W3C graph database standards, which is well suited for manipulating and querying over SQL query plans, which are graphs themselves. GALO acts as a third-tier of re-optimization, after query rewrite and cost-based optimization, as a query plan rewrite. For generality, the context of knowledge base problem patterns, including table and column names, is abstracted with canonical symbol labels. Since the knowledge base is not tied to the context of supplied QEPs, table and column names are matched automatically during the re-optimization phase. Thus, problem patterns learned over a particular query workload can be applied in other query workloads. GALO's knowledge base is also an invaluable tool for database experts to debug query performance issues by tracking to known issues and solutions as well as refining the optimizer with new tuned techniques by the development team. We demonstrate an experimental study of the effectiveness of our techniques over synthetic TPC-DS and real IBM client query workloads.
机译:查询优化是数据库系统的标志。当SQL查询的运行成本超出可行或保证的范围时,通常会通过分析查询的执行计划(QEP)与专家协商,手动确定性能问题。但是,这是一个非常耗时,易人为错误且成本很高的过程。 GALO是一个新颖的系统,可以自动执行此过程。该工具在脱机学习阶段自动通过工作负载学习查询计划中的重复出现的问题模式,以建立计划重写补救措施的知识库。然后,它会使用在线知识库来经常大幅度地重新优化查询。 GALO的知识库建立在RDF和SPARQL(W3C图形数据库标准)的基础上,非常适合对SQL查询计划(即图形本身)进行操作和查询。在重写查询和基于成本的优化之后,GALO充当重新优化的第三层,是重写查询计划。通常,知识库问题模式(包括表名和列名)的上下文都使用规范的符号标签进行了抽象。由于知识库不依赖于所提供的QEP的上下文,因此表和列的名称在重新优化阶段会自动匹配。因此,可以将通过特定查询工作负载学习的问题模式应用于其他查询工作负载。 GALO的知识库也是数据库专家通过跟踪已知问题和解决方案以及开发团队使用新的优化技术优化优化器来调试查询性能问题的宝贵工具。我们展示了一项实验研究,证明了我们的技术在合成TPC-DS和实际IBM客户端查询工作负载上的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号