首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Efficient Computation of Iceberg Cubes by Bounding Aggregate Functions
【24h】

Efficient Computation of Iceberg Cubes by Bounding Aggregate Functions

机译:有界聚合函数对冰山立方体的高效计算

获取原文
获取原文并翻译 | 示例
           

摘要

The iceberg cubing problem is to compute the multidimensional group-by partitions that satisfy given aggregation constraints. Pruning unproductive computation for iceberg cubing when nonantimonotone constraints are present is a great challenge because the aggregate functions do not increase or decrease monotonically along the subset relationship between partitions. In this paper, we propose a novel bound prune cubing (BP-Cubing) approach for iceberg cubing with nonantimonotone aggregation constraints. Given a cube over n dimensions, an aggregate for any group-by partition can be computed from aggregates for the most specific n--dimensional partitions (MSPs). The largest and smallest aggregate values computed this way become the bounds for all partitions in the cube. We provide efficient methods to compute tight bounds for base aggregate functions and, more interestingly, arithmetic expressions thereof, from bounds of aggregates over the MSPs. Our methods produce tighter bounds than those obtained by previous approaches. We present iceberg cubing algorithms that combine bounding with efficient aggregation strategies. Our experiments on real-world and artificial benchmark data sets demonstrate that BP-Cubing algorithms achieve more effective pruning and are several times faster than state-of-the-art iceberg cubing algorithms and that BP-Cubing achieves the best performance with the top-down cubing approach.
机译:冰山线索问题是计算满足给定聚合约束的多维分组依据分区。当存在非反单调约束时,对冰山多维数据集修剪非生产性计算是一个巨大的挑战,因为聚合函数不会沿着分区之间的子集关系单调增加或减少。在本文中,我们提出了一种新颖的约束修剪(BP-Cubing)方法,用于非反单调聚集约束下的冰山冻结。给定n个维度上的多维数据集,可以从最特定的n维分区(MSP)的聚合中计算出任何分组依据分区的聚合。以这种方式计算的最大和最小聚合值成为多维数据集中所有分区的边界。我们提供了有效的方法来计算基本聚合函数的紧密边界,更有趣的是,从MSP上的聚合边界计算其算术表达式。我们的方法产生的边界比以前的方法更严格。我们提出了结合了边界与有效聚合策略的冰山求知算法。我们在现实世界和人工基准数据集上的实验表明,BP-Cubing算法可实现更有效的修剪,并且比最新的冰山cubing算法快几倍,并且BP-Cubing在性能最高的情况下可获得最佳性能。向下探查方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号