首页> 外文会议>Data Warehousing and Knowledge Discovery; Lecture Notes in Computer Science; 4081 >An Effective Algorithm to Extract Dense Sub-cubes from a Large Sparse Cube
【24h】

An Effective Algorithm to Extract Dense Sub-cubes from a Large Sparse Cube

机译:从大型稀疏立方体中提取密集子立方体的有效算法

获取原文
获取原文并翻译 | 示例

摘要

A data cube provides aggregate information to support a class of queries such as a range-sum query. To process those queries efficiently, some auxiliary information, i.e. prefix sums, is pre-computed and maintained. In reality however most of high dimensional data cubes are very sparse, causing a serious space overhead. In this paper, we investigate an algorithm that extracts dense sub-cubes from a large sparse cube based on the density function. Instead of maintaining a large prefix-sum cube, a few dense sub-cubes are maintained to reduce the space overhead and to restrict the update propagation. We present an iterative method that identifies dense intervals in each dimension and constructs sub-cubes based on the intervals found. We show the effectiveness of our method through the analytic comparison and experiment with respect to various data sets and dimensions.
机译:数据多维数据集提供聚合信息以支持一类查询,例如范围和查询。为了有效地处理那些查询,一些辅助信息,即前缀和,被预先计算和维护。然而实际上,大多数高维数据立方体非常稀疏,从而导致严重的空间开销。在本文中,我们研究了一种基于密度函数从大型稀疏立方体中提取密集子立方体的算法。代替维护大的前缀和多维数据集,而是维护几个密集的子多维数据集以减少空间开销并限制更新传播。我们提出了一种迭代方法,该方法可以识别每个维度中的密集间隔并根据找到的间隔构造子立方体。通过对各种数据集和维度进行分析比较和实验,我们证明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号