【24h】

A New Algorithm for Accurate Histogram Construction

机译:一种新的准确直方图构造算法

获取原文

摘要

Many commercial relational database systems use histograms to summarize data sets and also to determine the frequency distribution of attribute values. Based on this distribution, a database system estimates query result sizes within query optimization useful in effective information retrieval. Moreover, histograms are beneficial for judging whether the quality of the source is reliable or not; therefore, they enable us/one to decide whether to keep this source in the information retrieval or remove it. Each histogram contains commonly an error which affects the accuracy of the estimation. This work surveys the state of the art on the problem of identifying optimal histograms, studies the effectiveness of these optimal histograms in limiting error propagation in the context of query optimization, and proposes a new algorithm for accurate histogram construction. As a result, we can conclude that theoretical results are confirmed in practice. In fact, the proposed histogram generates a low error.
机译:许多商业关系数据库系统使用直方图来总结数据集,并确定属性值的频率分布。基于此分布,数据库系统估计在有效信息检索中有用的查询优化内的查询结果大小。此外,直方图有利于判断源的质量是否可靠;因此,它们使我们/人能够决定是否在信息检索或删除它的情况下保持此源。每个直方图通常包含影响估计准确性的错误。这项工作对识别最佳直方图的问题进行了解决问题的问题,研究了这些最佳直方图在查询优化背景下限制误差传播的有效性,并提出了一种用于精确直方图构造的新算法。结果,我们可以得出结论,理论结果在实践中得到证实。实际上,所提出的直方图产生低错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号