【24h】

Practical selectivity estimation through adaptive sampling

机译:通过自适应采样进行实际选择性估计

获取原文

摘要

Recently we have proposed an adaptive, random sampling algorithm for general query size estimation. In earlier work we analyzed the asymptotic efficiency and accuracy of the algorithm, in this paper we investigate its practicality as applied to selects and joins. First, we extend our previous analysis to provide significantly improved bounds on the amount of sampling necessary for a given level of accuracy. Next, we provide "sanity bounds" to deal with queries for which the underlying data is extremely skewed or the query result is very small. Finally, we report on the performance of the estimation algorithm as implemented in a host language on a commercial relational system. The results are encouraging, even with this loose coupling between the estimation algorithm and the DBMS.

机译:

最近,我们提出了一种用于一般查询大小估计的自适应随机抽样算法。在较早的工作中,我们分析了算法的渐近效率和准确性,在本文中,我们研究了该算法在选择和联接中的实用性。首先,我们扩展了先前的分析,以提供给定精度水平所需的采样量的显着改善的界限。接下来,我们提供“理智界限”来处理基础数据极度偏斜或查询结果非常小的查询。最后,我们报告了在商业关系系统上以宿主语言实现的估算算法的性能。即使估计算法与DBMS之间存在松散耦合,结果还是令人鼓舞的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号