首页> 外文会议>ACM SIGMOD international conference on management of data;SIGMOD 2010 >Logging Every Footstep: Quantile Summaries for the Entire History
【24h】

Logging Every Footstep: Quantile Summaries for the Entire History

机译:记录每一个脚步:整个历史的分位数摘要

获取原文

摘要

Quantiles are a crucial type of order statistics in databases. Extensive research has been focused on maintaining a space-efficient structure for approximate quantile computation as the underlying dataset is updated. The existing solutions, however, are designed to support only the current, most-updated, snapshot of the dataset. Queries on the past versions of the data cannot be answered.This paper studies the problem of historical quantile search. The objective is to enable e-approximate quantile retrieval on any snapshot of the dataset in history. The problem is very important in analyzing the evolution of a distribution, monitoring the quality of services, query optimization in temporal databases, and so on. We present the first formal results in the literature. First, we prove a novel theoretical lower bound on the space cost of supporting e-approximate historical quantile queries. The bound reveals the fundamental difference between answering quantile queries about the past and those about the present time. Second, we propose a structure for finding e-approximate historical quantiles, and show that it consumes more space than the lower bound by only a square-logarithmic factor. Extensive experiments demonstrate that in practice our technique performs much better than predicted by theory. In particular, the quantiles it returns are remarkably more accurate than the theoretical precision guarantee.
机译:分位数是数据库中订单统计的重要类型。随着基础数据集的更新,广泛的研究集中在为近似分位数计算维护空间高效的结构上。但是,现有解决方案仅设计为仅支持数据集的最新,最新快照。对过去数据版本的查询无法回答。 本文研究了历史分位数搜索的问题。目的是在历史数据集的任何快照上启用电子近似分位数检索。该问题在分析分布的演变,监视服务质量,在时间数据库中进行查询优化等方面非常重要。我们介绍了文献中的第一个正式结果。首先,我们证明了支持电子近似历史分位数查询的空间成本的新颖理论下界。该界限揭示了回答有关过去的分位数查询和有关当前的分位数查询之间的根本区别。其次,我们提出了一种用于查找电子近似历史分位数的结构,并表明它所消耗的空间比仅用平方对数因子的下限要大。大量的实验表明,在实践中,我们的技术比理论预测的要好得多。特别是,它返回的分位数比理论上的精度保证要精确得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号