首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Efficient Iceberg Query Evaluation Using Compressed Bitmap Index
【24h】

Efficient Iceberg Query Evaluation Using Compressed Bitmap Index

机译:使用压缩位图索引的高效Iceberg查询评估

获取原文
获取原文并翻译 | 示例
       

摘要

Decision support and knowledge discovery systems often compute aggregate values of interesting attributes by processing a huge amount of data in very large databases and/or warehouses. In particular, iceberg query is a special type of aggregation query that computes aggregate values above a user-provided threshold. Usually, only a small number of results will satisfy the threshold constraint. Yet, the results often carry very important and valuable business insights. Because of the small result set, iceberg queries offer many opportunities for deep query optimization. However, most existing iceberg query processing algorithms do not take advantage of the small-result-set property and rely heavily on the tuple-scan-based approach. This incurs intensive disk accesses and computation, resulting in long processing time especially when data size is large. Bitmap index, which builds one bitmap vector for each attribute value, is gaining popularity in both column-oriented and row-oriented databases in recent years. It occupies less space than the raw data and gives opportunities for more efficient query processing. In this paper, we exploited the property of bitmap index and developed a very effective bitmap pruning strategy for processing iceberg queries. Our index-pruning-based approach eliminates the need of scanning and processing the entire data set (table) and thus speeds up the iceberg query processing significantly. Experiments show that our approach is much more efficient than existing algorithms commonly used in row-oriented and column-oriented databases.
机译:决策支持和知识发现系统通常通过处理超大型数据库和/或仓库中的大量数据来计算有趣属性的合计值。特别地,iceberg查询是一种特殊类型的聚合查询,它计算高于用户提供的阈值的聚合值。通常,只有少数结果会满足阈值约束。但是,结果通常带有非常重要且有价值的业务见解。由于结果集很小,因此冰山查询为深度查询优化提供了许多机会。但是,大多数现有的冰山查询处理算法没有利用小结果集属性,而是严重依赖基于元组扫描的方法。这会导致密集的磁盘访问和计算,从而导致处理时间较长,尤其是在数据量较大时。近年来,位图索引为每个属性值构建一个位图矢量,在面向列和面向行的数据库中都越来越受欢迎。它比原始数据占用更少的空间,并为更有效的查询处理提供了机会。在本文中,我们利用位图索引的属性并开发了一种非常有效的位图修剪策略来处理冰山查询。我们基于索引修剪的方法消除了扫描和处理整个数据集(表)的需要,从而显着加快了冰山查询处理的速度。实验表明,我们的方法比面向行和面向列的数据库中常用的现有算法高效得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号