【24h】

Mining Multi-Dimensional Constrained Gradients in Data Cubes

机译:在数据立方体中挖掘多维约束梯度

获取原文
获取原文并翻译 | 示例

摘要

Constrained gradient analysis (similar to the "cubegrade" problem posed by Imielinski, et al. [9]) is to extract pairs of similar cell characteristics associated with big changes in measure in a data cube. Cells are considered similar if they are related by roll-up, drill-down, or 1-dimensional mutation operation. Constrained gradient queries are expressive, capable of capturing trends in data and answering "what-if" questions. To facilitate our discussion, we call one cell in a gradient pair probe cell and the other gradient cell. An efficient algorithm is developed, which pushes constraints deep into the computation process, finding all gradient-probe cell pairs in one pass. It explores bi-directional pruning between probe cells and gradient cells, utilizing transformed measures and dimensions. Moreover, it adopts a hyper-tree structure and an H-cubing method to compress data and maximize sharing of computation. Our performance study shows that this algorithm is efficient and scalable.
机译:约束梯度分析(类似于Imielinski等人[9]提出的“多维数据集转换”问题)是提取与数据多维数据集中的度量值大变化相关的相似单元格特征对。如果单元格通过上滚,下钻或一维突变操作关联,则认为它们相似。约束梯度查询具有表现力,能够捕获数据趋势并回答“假设”问题。为便于讨论,我们将一个单元称为梯度对探针单元,将另一个单元称为梯度单元。开发了一种有效的算法,它将约束推入计算过程的深处,一次即可找到所有梯度探针单元对。它利用转换后的度量和尺寸来探索探针细胞和梯度细胞之间的双向修剪。而且,它采用超树结构和H-cubing方法来压缩数据并最大程度地共享计算。我们的性能研究表明,该算法高效且可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号