...
首页> 外文期刊>Distributed and Parallel Databases >PnP: sequential, external memory, and parallel iceberg cube computation
【24h】

PnP: sequential, external memory, and parallel iceberg cube computation

机译:PnP:顺序,外部存储器和并行冰山多维数据集计算

获取原文
获取原文并翻译 | 示例
           

摘要

We present "Pipe 'n Prune" (PnP), a new hybrid method for iceberg-cube query computation. The novelty of our method is that it achieves a tight integration of top-down piping for data aggregation with bottom-up a priori data pruning. A particular strength of PnP is that it is efficient for all of the following scenarios: (1) Sequential iceberg-cube queries, (2) External memory iceberg-cube queries, and (3) Parallel iceberg-cube queries on shared-nothing PC clusters with multiple disks. We performed an extensive performance analysis of PnP for the above scenarios with the following main results: In the first scenario PnP performs very well for both dense and sparse data sets, providing an interesting alternative to BUC and Star-Cubing. In the second scenario PnP shows a surprisingly efficient handling of disk I/O, with an external memory running time that is less than twice the running time for full in-memory computation of the same iceberg-cube query. In the third scenario PnP scales very well, providing near linear speedup for a larger number of processors
机译:我们提出“ Pipe'n Prune”(PnP),一种用于冰山多维数据集查询计算的新混合方法。我们方法的新颖之处在于,它实现了自上而下的数据聚合管道与自下而上的先验数据修剪紧密集成。 PnP的一个特别优势是,它对于以下所有情况均有效:(1)顺序冰山多维数据集查询,(2)外部存储器冰山多维数据集查询,以及(3)无共享PC上的并行冰山多维数据集查询具有多个磁盘的群集。我们对上述场景的PnP进行了广泛的性能分析,得到以下主要结果:在第一个场景中,PnP在密集和稀疏数据集上均表现出色,为BUC和Star-Cubing提供了一种有趣的替代方法。在第二种情况下,PnP显示出令人惊讶的磁盘I / O处理效率,外部存储器运行时间少于相同冰山多维数据集查询的完整内存计算运行时间的两倍。在第三种情况下,PnP可很好地扩展,为大量处理器提供近乎线性的加速

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号