【24h】

Fast Incremental Maintenance of Approximate Histograms

机译:快速增量维护近似直方图

获取原文
获取原文并翻译 | 示例

摘要

Many commercial database systems maintain histograms to summarize the contents of large relations and permit efficient estimation of query result sizes for use in query optimizers. Delaying the propagation of database updates to the histogram often introduces errors in the estimation. This paper presents new sampling-based approaches for incremental maintenance of approximate histograms. By scheduling updates to the histogram based on the updates to the database, our techniques are the first to maintain histograms effectively up-to-date at all times and avoid computing overheads when unnecessary. Our techniques provide highly-accurate approximate histograms belonging to the equi-depth and Compressed classes. Experimental results show that our new approaches provide orders of magnitude more accurate estimation than previous approaches.rnAn important aspect employed by these new approaches is a backing sample, an up-to-date random sample of the tuples currently in a relation. We provide efficient solutions for maintaining a uniformly random sample of a relation in the presence of updates to the relation. The backing sample techniques can be used for any other application that relies on random samples of data.
机译:许多商业数据库系统都维护直方图以总结大关系的内容,并允许有效地估计查询结果大小,以供查询优化器使用。延迟将数据库更新传播到直方图通常会在估计中引入错误。本文介绍了新的基于采样的方法,用于近似直方图的增量维护。通过基于对数据库的更新来调度对直方图的更新,我们的技术是第一个始终始终有效地保持直方图最新状态并在不必要时避免计算开销的技术。我们的技术提供了属于等深度和压缩类的高精度近似直方图。实验结果表明,我们的新方法比以前的方法提供了更精确的数量级。这些新方法采用的一个重要方面是支持样本,即当前关系中元组的最新随机样本。我们提供了有效的解决方案,用于在存在关系更新的情况下维护关系的均匀随机样本。支持样本技术可用于依赖于数据随机样本的任何其他应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号