...
首页> 外文期刊>The VLDB journal >Histograms based on the minimum description length principle
【24h】

Histograms based on the minimum description length principle

机译:基于最小描述长度原理的直方图

获取原文
获取原文并翻译 | 示例
           

摘要

Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads.
机译:直方图已广泛用于查询优化中的选择性估计,以及许多OLAP,数据挖掘和数据可视化应用程序中的快速近似查询回答。本文基于最小描述长度原理,提出了一个新的直方图族,即层次模型拟合(HMF)直方图。 HMF直方图不是使用相同类型的模型描述直方图的每个存储桶,而是为每个存储桶采用局部最优模型。局部选择模型的提高的有效性比跟踪每个单独存储桶的表示的开销要大得多。通过一组实验,我们表明,对于各种查询工作负载中的许多实际和合成数据集,HMF直方图能够提供比以前提出的技术更准确的近似值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号