首页> 外文会议>ACM SIGMOD international conference on Management of data >Join synopses for approximate query answering
【24h】

Join synopses for approximate query answering

机译:加入概要以进行近似查询回答

获取原文

摘要

In large data warehousing environments, it is often advantageous to provide fast, approximate answers to complex aggregate queries based on statistical summaries of the full data. In this paper, we demonstrate the difficulty of providing good approximate answers for join-queries using only statistics (in particular, samples) from the base relations. We propose join synopses as an effective solution for this problem and show how precomputing just one join synopsis for each relation suffices to significantly improve the quality of approximate answers for arbitrary queries with foreign key joins. We present optimal strategies for allocating the available space among the various join synopses when the query work load is known and identify heuristics for the common case when the work load is not known. We also present efficient algorithms for incrementally maintaining join synopses in the presence of updates to the base relations. Our extensive set of experiments onthe TPC-D benchmark database show the effectiveness of join synopses and various other techniques proposed in this paper.

机译:

在大型数据仓库环境中,基于完整数据的统计摘要为复杂的聚合查询提供快速,近似的答案通常是有利的。在本文中,我们演示了仅使用基本关系中的统计信息(尤其是样本)来为联接查询提供良好的近似答案的困难。我们提出 join提要作为此问题的有效解决方案,并显示针对每个关系仅预先计算一个join提要足以显着提高针对带有外部查询的任意查询的近似答案的质量。键联接。我们提出了在查询工作负载已知时在各种联接概要之间分配可用空间的最佳策略,并在工作负载未知时针对常见情况标识了启发式方法。我们还提出了有效的算法,用于在基本关系更新存在的情况下增量维护联接概要。我们在TPC-D基准数据库上进行的大量实验证明了连接提要和本文提出的其他各种技术的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号