首页> 外文期刊>Information Systems >Emerging Cubes: Borders, size estimations and lossless reductions
【24h】

Emerging Cubes: Borders, size estimations and lossless reductions

机译:新兴立方体:边界,尺寸估算和无损缩减

获取原文
获取原文并翻译 | 示例
       

摘要

Discovering trend reversals between two data cubes provides users with a novel and interesting knowledge when the real world context fluctuates: What is new? Which trends appear or emerge? Which tendencies are immersing or disappear? With the concept of Emerging Cube, we capture such trend reversals by enforcing an emergence constraint. We resume the classical borders for the Emerging Cube and introduce a new one which optimizes both storage space and computation time, provides a simple characterization of the size of Emerging Cubes, as well as classification and cube navigation tools. We soundly state the connection between the classical and proposed borders by using cube transversals. Knowing the size of Emerging Cubes without computing them is of great interest in particular for adjusting at best the underlying emergence constraint. We address this issue by studying an upper bound and characterizing the exact size of Emerging Cubes. We propose two strategies for quickly estimate their size: one based on analytical estimation, without database access, and one based on probabilistic counting using the proposed borders as the input of the near-optimal algorithm HyperLogLoc. Due to the efficiency of the estimation algorithm various iterations can be performed to calibrate at best the emergence constraint. Moreover, we propose reduced and lossless representations of the Emerging Cube by using the concept of cube closure. Finally, we perform experiments for different data distributions in order to measure on one hand the size of the introduced condensed and concise representations and on the other hand the performance (accuracy and computation time) of the proposed estimation method.
机译:当现实环境发生波动时,发现两个数据立方体之间的趋势反转将为用户提供新颖而有趣的知识:什么是新东西?出现或出现哪些趋势?哪些趋势正在浸没或消失?通过“新兴多维数据集”的概念,我们通过强制出现出现约束来捕获这种趋势逆转。我们恢复了新兴多维数据集的经典边界,并引入了一种新方法,该方法可优化存储空间和计算时间,提供了新兴多维数据集大小的简单表征,以及分类和多维数据集导航工具。我们通过使用立方体横断面很好地陈述了经典边框和建议边框之间的联系。知道新兴多维数据集的大小而不进行计算非常有趣,特别是对于最多调整潜在的出现约束。我们通过研究上限并描述新兴多维数据集的确切大小来解决此问题。我们提出了两种策略来快速估计其大小:一种基于分析估计,无需数据库访问,另一种基于概率计数(使用提议的边界作为近似最优算法HyperLogLoc的输入)。由于估计算法的效率,可以执行各种迭代以最好地校准出现约束。此外,我们提出了使用多维数据集封闭的概念来简化和无损表示新兴多维数据集。最后,我们针对不同的数据分布进行实验,以便一方面测量引入的简明表示的大小,另一方面测量所提出的估计方法的性能(准确性和计算时间)。

著录项

  • 来源
    《Information Systems》 |2009年第6期|536-550|共15页
  • 作者单位

    Laboratoire d'Informatique Fondamentale de Marseille (LIF), Aix-Marseille Universite - CNRS Case 901, 163 Avenue de Lummy, 13288 Marseille Cedex 9, France;

    Laboratoire d'Informatique Fondamentale de Marseille (LIF), Aix-Marseille Universite - CNRS Case 901, 163 Avenue de Lummy, 13288 Marseille Cedex 9, France;

    Laboratoire d'Informatique Fondamentale de Marseille (LIF), Aix-Marseille Universite - CNRS Case 901, 163 Avenue de Lummy, 13288 Marseille Cedex 9, France;

    Laboratoire d'Informatique Fondamentale de Marseille (LIF), Aix-Marseille Universite - CNRS Case 901, 163 Avenue de Lummy, 13288 Marseille Cedex 9, France;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    O_(LAP) mining; data warehouse; data cube; trend analysis; cube size estimation; closure;

    机译:O_(LAP)开采;数据仓库;数据立方体趋势分析;立方体尺寸估计;关闭;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号