首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery(DaWaK 2006) >An Effective Algorithm to Extract Dense Sub-cubes from a Large Sparse Cube
【24h】

An Effective Algorithm to Extract Dense Sub-cubes from a Large Sparse Cube

机译:从大稀疏多维数据集中提取密集子立方体的有效算法

获取原文

摘要

A data cube provides aggregate information to support a class of queries such as a range-sum query. To process those queries efficiently, some auxiliary information, i.e. prefix sums, is pre-computed and maintained. In reality however most of high dimensional data cubes are very sparse, causing a serious space overhead. In this paper, we investigate an algorithm that extracts dense sub-cubes from a large sparse cube based on the density function. Instead of maintaining a large prefix-sum cube, a few dense sub-cubes are maintained to reduce the space overhead and to restrict the update propagation. We present an iterative method that identifies dense intervals in each dimension and constructs sub-cubes based on the intervals found. We show the effectiveness of our method through the analytic comparison and experiment with respect to various data sets and dimensions.
机译:数据多维数据集提供聚合信息,以支持一类查询,例如范围和查询。为了有效地处理这些查询,将预先计算和维护一些辅助信息,即前缀和。然而,实际上,大多数高维数据立方体都非常稀疏,导致严重的空间开销。在本文中,我们研究了一种算法,其基于密度函数从大稀疏多维数据集中提取密集的子立方体。而不是维护大的前缀 - 和多维数据集,因此保持了几个密集的子立方体以减少空间开销并限制更新传播。我们介绍了一种迭代方法,该方法识别每个维度中的密集间隔,并根据找到的间隔构建子立方体。我们通过对各种数据集和尺寸的分析比较和实验来展示我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号