Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration

机译：星形计算：通过自上而下和自下而上的集成计算冰山多维数据集

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data cube computation is one of the most essential but expensive operations in data warehousing. Previous studies have developed two major approaches, top-down vs. bottom-up. The former, represented by the Multi-Way Array Cube (called MultiWay) algorithm [25], aggregates simultaneously on multiple dimensions; however, it cannot take advantage of Apriori pruning [2] when computing iceberg cubes (cubes that contain only aggregate cells whose measure value satisfies a threshold, called iceberg condition). The latter, represented by two algorithms: BUC [6] and H-Cubing[11], computes the iceberg cube bottom-up and facilitates Apriori pruning. BUC explores fast sorting and partitioning techniques; whereas H-Cubing explores a data structure, H-Tree, for shared computation. However, none of them fully explores multi-dimensional simultaneous aggregation. In this paper, we present a new method, Star-Cubing, that integrates the strengths of the previous three algorithms and performs aggregations on multiple dimensions simultaneously. It utilizes a star-tree structure, extends the simultaneous aggregation methods, and enables the pruning of the group-by's that do not satisfy the iceberg condition. Our performance study shows that Star-Cubing is highly efficient and outperforms all the previous methods in almost all kinds of data distributions.

机译：数据多维数据集计算是数据仓库中最重要但最昂贵的操作之一。先前的研究开发了两种主要方法，即自上而下与自下而上。前者以多维数组多维数据集（称为MultiWay）算法[25]为代表，同时在多个维度上聚合。但是，在计算冰山多维数据集（仅包含度量值满足阈值的聚合单元的多维数据集，称为冰山条件）时，无法利用Apriori修剪[2]的优势。后者由两种算法表示：BUC [6]和H-Cubing [11]，用于计算冰山立方自下而上并促进Apriori修剪。 BUC探索快速排序和分区技术；而H-Cubing探索了用于共享计算的数据结构H-Tree。但是，它们都没有充分探讨多维同时聚合。在本文中，我们提出了一种新方法Star-Cubing，该方法整合了前三种算法的优势并同时在多个维度上进行聚合。它利用星形树结构，扩展了同时聚合方法，并可以对不满足冰山条件的分组依据进行修剪。我们的性能研究表明，Star-Cubing是高效的，并且在几乎所有类型的数据分布中都优于以前的所有方法。

著录项

来源
《Twenty-ninth International Conference on Very Large Databases; Sep 9-12, 2003; Berlin, Germany》|2003年|p.476-487|共12页
会议地点 Berlin(DE);Berlin(DE)
作者
Dong Xin; Jiawei Han; Xiaolei Li; Benjamin W. Wah;
展开▼
作者单位

University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
入库时间 2022-08-26 14:15:36

相似文献

外文文献
中文文献
专利

1. Computing Iceberg Cubes by Top-Down and Bottom-Up Integration: The StarCubing Approach [J] . Dong Xin, Jiawei Han, Xiaolei Li, IEEE Transactions on Knowledge and Data Engineering . 2007,第期

机译：通过自上而下和自下而上的集成计算冰山多维数据集：StarCubing方法
2. The Multi-Tree Cubing algorithm for computing iceberg cubes [J] . Xing Li, Howard J. Hamilton, Kamran Karimi, Journal of Intelligent Information Systems . 2009,第2期

机译：用于计算冰山立方体的Multi-Tree Cubing算法
3. A fully computable model of bottom-up and top-down processing in high-level visual cortex [J] . Kendrick Kay, Jason Yeatman Journal of vision . 2016,第12期

机译：在高级视觉皮层中自下而上和自上而下处理的完全可计算模型
4. Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration [C] . Dong Xin, Jiawei Han, Xiaolei Li, International conference on very large databases . 2003

机译：星团：通过自上而下和自下而上的集成计算冰山立方体
5. The multi-tree cubing algorithm for computing iceberg cubes. [D] . Li, Xing. 2005

机译：用于计算冰山多维数据集的多树多维数据集算法。
6. Expectation and attention increase the integration of top-down and bottom-up signals in perception through different pathways [O] . Noam Gordon, Naotsugu Tsuchiya, Roger Koenig-Robert, 2019

机译：期望和注意力会通过不同途径增强自上而下和自下而上的信号在感知中的整合
7. MM-Cubing: Computing Iceberg Cubes by Factorizing the Lattice Space [O] . Zheng Shao, Jiawei Han, Dong Xin 2004

机译：MM-Cubing：通过分解格空间来计算冰山立方体

Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration

摘要

著录项

相似文献

相关主题

期刊订阅