【24h】

Self-tuning UDF Cost Modeling Using the Memory-Limited Quadtree

机译:使用内存受限四叉树的自调整UDF成本建模

获取原文
获取原文并翻译 | 示例

摘要

Query optimizers in object-relational database management systems require users to provide the execution cost models of user-defined functions(UDFs). Despite this need, however, there has been little work done to provide such a model. Furthermore, none of the existing work is self-tuning and, therefore, cannot adapt to changing UDF execution patterns. This paper addresses this problem by introducing a self-tuning cost modeling approach based on the quadtree. The quadtree has the inherent desirable properties to (1) perform fast retrievals, (2) allow for fast incremental updates (without storing individual data points), and (3) store information at different resolutions. We take advantage of these properties of the quadtree and add the following in order to make the quadtree useful for UDF cost modeling: the abilities to (1) adapt to changing UDF execution patterns and (2) use limited memory. To this end, we have developed a novel technique we call the memory-limited quadtree(MLQ). In MLQ, each instance of UDF execution is mapped to a query point in a multi-dimensional space. Then, a prediction is made at the query point, and the actual value at the point is inserted as a new data point. The quadtree is then used to store summary information of the data points at different resolutions based on the distribution of the data points. This information is used to make predictions, guide the insertion of new data points, and guide the compression of the quadtree when the memory limit is reached. We have conducted extensive performance evaluations comparing MLQ with the existing (static) approach.
机译:对象关系数据库管理系统中的查询优化器要求用户提供用户定义函数(UDF)的执行成本模型。尽管有这种需求,但是提供这种模型的工作很少。此外,现有的工作都不是自调整的,因此不能适应不断变化的UDF执行模式。本文通过介绍一种基于四叉树的自调整成本建模方法来解决此问题。四叉树具有以下固有的期望属性:(1)执行快速检索,(2)允许快速增量更新(不存储单个数据点),以及(3)以不同的分辨率存储信息。我们利用四叉树的这些属性,并添加以下内容以使四叉树可用于UDF成本建模:(1)适应不断变化的UDF执行模式和(2)使用有限内存的能力。为此,我们开发了一种新颖的技术,称为内存受限四叉树(MLQ)。在MLQ中,UDF执行的每个实例都映射到多维空间中的查询点。然后,在查询点进行预测,并将该点的实际值插入为新的数据点。然后,使用四叉树基于数据点的分布以不同的分辨率存储数据点的摘要信息。当达到内存限制时,此信息用于进行预测,指导新数据点的插入以及指导四叉树的压缩。我们已经进行了广泛的性能评估,将MLQ与现有(静态)方法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号