首页> 外文会议>International Conference on Very Large Data Bases >CURE for Cubes: Cubing Using a ROLAP Engine
【24h】

CURE for Cubes: Cubing Using a ROLAP Engine

机译:治疗立方体:使用ROLAP引擎的突击队员

获取原文

摘要

Data cube construction has been the focus of much research due to its importance in improving efficiency of OLAP. A significant fraction of this work has been on ROLAP techniques, which are based on relational technology. Existing ROLAP cubing solutions mainly focus on "flat" datasets, which do not include hierarchies in their dimensions. Nevertheless, the nature of hierarchies introduces several complications into cube construction, making existing techniques essentially inapplicable in a significant number of real-world applications. In particular, hierarchies raise three main challenges: (a) The number of nodes in a cube lattice increases dramatically and its shape is more involved. These require new forms of lattice traversal for efficient execution, (b) The number of unique values in the higher levels of a dimension hierarchy may be very small; hence, partitioning data into fragments that fit in memory and include all entries of a particular value may often be impossible. This requires new partitioning schemes, (c) The number of tuples that need to be materialized in the final cube increases dramatically. This requires new storage schemes that remove all forms of redundancy for efficient space utilization. In this paper, we propose CURE, a novel ROLAP cubing method that addresses these issues and constructs complete data cubes over very large datasets with arbitrary hierarchies. CURE contributes a novel lattice traversal scheme, an optimized partitioning method, and a suite of relational storage schemes for all forms of redundancy. We demonstrate the effectiveness of CURE through experiments on both real-world and synthetic datasets. Among the experimental results, we distinguish those that have made CURE the first ROLAP technique to complete the construction of the cube of the highest-density dataset in the APB-1 benchmark (12 GB). CURE was in fact quite efficient on this, showing great promise with respect to the potential of the technique overall.
机译:数据立方体建设是由于其在提高OLAP效率方面的重要性,这一点是巨大的研究。这项工作的大部分是基于关系技术的ROLAP技术。现有的ROLAP立方解决方案主要关注“扁平”数据集,其不包括其尺寸的层次结构。然而,层次结构的性质引入了几个并发症的立方体建设,使现有技术在大量现实世界应用中基本上不适用。特别是,层次结构提高了三个主要挑战:(a)立方体格子中的节点数量显着增加,其形状更涉及。这些需要新的格子遍历遍历,以便有效执行,(b)较高级别层次结构中的唯一值的数量可能非常小;因此,将数据分成适合存储器的片段,并且包括特定值的所有条目通常是不可能的。这需要新的划分方案,(c)在最终立方体中需要物质化的元组的数量急剧增加。这需要新的存储方案,以删除所有形式的冗余以实现有效的空间利用率。在本文中,我们提出了一种解决这些问题的新型ROLAP CUBICE方法,并在具有任意层次结构的非常大的数据集中构建完整的数据多维数据集。 Cure为所有形式的冗余提供了一种新颖的晶格遍历方案,优化的分区方法,以及一套关系存储方案。我们通过实际和合成数据集的实验证明了治愈的有效性。在实验结果中,我们区分了解决了第一个ROLAP技术的那些,以完成APB-1基准(12 GB)中最高密度数据集的立方体的结构。实际上,治愈对此非常有效,对整体技术的潜力表现出很大的希望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号