首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Optimizing sort order query execution in balanced and nested grid files
【24h】

Optimizing sort order query execution in balanced and nested grid files

机译:优化平衡和嵌套网格文件中的排序顺序查询执行

获取原文
获取原文并翻译 | 示例

摘要

Disk input/output (I/O) efficient query execution is an important topic with respect to DBMS performance. In this context, we elaborate on the construction of disk access plans for sort order queries in balanced and nested grid files. The key idea is to use the order information contained in the directory of the multiattribute search structure. The presented algorithms are shown to yield a significant decrease in the number of disk I/O operations by appropriate use of the order information. Two algorithms for the construction of appropriate disk access plans are proposed, namely a greedy approach and a heuristic divide-and-conquer approach. Both approaches yield considerable I/O savings compared to straightforward query processing without consideration of any directory order information. The former performs well for small buffer page allocations, i.e., for a small number of buffer pages relative to the number of data buckets processed in the query. The latter is superior to the greedy algorithm with respect to the total number of I/O operations and with respect to the overall maximum of buffer pages needed to achieve the minimal number of disk I/O operations. Both approaches rely on a binary trie as a temporary data structure. This trie is used as an explicit representation of the order information. The storage consumption of the temporary data structure is shown to be negligible in realistic cases, Even for pathological cases with respect to degenerated balanced and nested grid files, reasonable upper bounds can be given.
机译:磁盘输入/输出(I / O)高效查询执行是有关DBMS性能的重要主题。在这种情况下,我们将详细介绍在平衡和嵌套的网格文件中排序顺序查询的磁盘访问计划的构建。关键思想是使用包含在多属性搜索结构目录中的订单信息。通过适当使用顺序信息,显示出的算法可显着减少磁盘I / O操作的数量。提出了两种构造适当的磁盘访问计划的算法,即贪婪法和启发式分治法。与不考虑任何目录顺序信息的直接查询处理相比,这两种方法都可节省大量I / O。对于较小的缓冲区页面分配,即对于相对于查询中处理的数据存储桶数量的少量缓冲区页面而言,前者表现良好。就I / O操作的总数以及实现最少磁盘I / O操作所需的缓冲区页面的总体最大值而言,后者优于贪心算法。两种方法都依赖二进制特里树作为临时数据结构。该特里用作订单信息的显式表示。在现实情况下,临时数据结构的存储消耗可忽略不计。即使对于退化的平衡和嵌套网格文件的病理情况,也可以给出合理的上限。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号