首页> 外文会议>ACM SIGMOD International Conference on Management of Data >Self-tuning Histograms: Building Histograms Without Looking at Data
【24h】

Self-tuning Histograms: Building Histograms Without Looking at Data

机译:自我调整直方图:构建直方图而不查看数据

获取原文

摘要

In this paper, we introduce self-tuning histograms. Although similar in structure to traditional histograms, these histograms infer data distributions not by examining the data or a sample thereof, but by using feedback from the query execution engine about the actual selectivity of range selection operators to progressively refine the histogram. Since the cost of building and maintaining self-tuning histograms is independent of the data size, self-tuning histograms provide a remarkably inexpensive way to construct histograms for large data sets with little up-front costs. Self-tuning histograms are particularly attractive as an alternative to multi-dimensional traditional histograms that capture dependencies between attributes but are prohibitively expensive to build and maintain. In this paper, we describe the techniques for initializing and refining self-tuning histograms. Our experimental results show that self-tuning histograms provide a low-cost alternative to traditional multi-dimensional histograms with little loss of accuracy for data distributions with low to moderate skew.
机译:在本文中,我们介绍了自我调整直方图。尽管在结构上与传统直方图类似,但是这些直方图通过检查数据或其样本来推断数据分布,而是通过从查询执行引擎中使用反馈,围绕范围选择运算符的实际选择性来逐步细化直方图。由于构建和维护自调整直方图的成本与数据大小无关,自调谐直方图提供了一种非常便宜的方式来构建大数据集的直方图,具有较少的预付成本。自我调整直方图是替代对多维传统直方图的替代方案,可以捕获属性之间的依赖性,但是构建和维护的昂贵昂贵。在本文中,我们描述了初始化和精炼自调整直方图的技术。我们的实验结果表明,自调整直方图为传统的多维直方图提供了低成本的替代方案,对于具有低到中等歪斜的数据分布几乎没有损失。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号