【24h】

Bucket-Sorted Hash Join

机译:桶排序哈希加入

获取原文
获取原文并翻译 | 示例
           

摘要

As one of the most important operations in relational database management systems, the join operation is very time-consuming as it needs to merge related records between two tables to produce valuable data. Thus far, several join schemes have been proposed to improve the performance of the join operation, and the hybrid hash-join scheme generally shows the best performance among them. However, this scheme incurs a big overhead during the probing phase as it must scan all records across buckets in the hash table in order to find a corresponding record. In this study, we propose a new hash join scheme, called bucket-sorted hash join, which only maintains records sorted within a bucket. Our proposed scheme can significantly reduce the overhead incurred during the probing phase because all records are sorted within a bucket, and the corresponding records are easily found using a binary search. Our experiments show that the proposed scheme can improve the performance of the join operation by up to 300% in terms of the TPC-H benchmark compared to the hybrid hash join scheme. Thus, the proposed scheme is a viable alternative in hash join operations.
机译:作为关系数据库管理系统中最重要的操作之一,加入操作非常耗时,因为它需要合并两个表之间的相关记录以产生有价值的数据。到目前为止,已经提出了几种连接方案来改善加入操作的性能,并且混合散列连接方案通常显示它们之间的最佳性能。但是,此方案在探测阶段会引起大开销,因为它必须在哈希表中扫描桶中的所有记录才能找到相应的记录。在这项研究中,我们提出了一个新的哈希连接方案,称为桶排序哈希加入,只能维护在桶内排序的记录。我们所提出的方案可以显着降低探测阶段期间产生的开销,因为所有记录都在桶内排序,并且使用二进制搜索轻松找到相应的记录。我们的实验表明,与混合哈希连接方案相比,该方案可以将连接操作的性能提高到TPC-H基准,从TPC-H基准方案提高到300%。因此,所提出的方案是散列加入操作中可行的替代品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号