【24h】

A Skew-Insensitive Hashing Sync and Construction Scheme for Many-Core Coprocessors

机译:一种面向多核协处理器的斜度不敏感的哈希同步和构造方案

获取原文
获取原文并翻译 | 示例

摘要

Many-core coprocessors have been widely used in current high-performance computing area due to their highly parallel computing capabilities and superior energy efficiencies. Hash tables are one of the most used and most important data structures in many data-intensive applications, especially in main-memory database systems. Since the operation of building hash tables is a write-intensive activity, the biggest challenge for this work in a high-concurrency environment is multithread synchronization, especially when workloads are highly skewed. In order to overcome this problem, we propose an efficient parallel hash table construction scheme, including SIMD-aware data structure and a novel hashing synchronous processing algorithm named Pipelined Hashing Sync (PHS). The PHS is performed in two phases: in the first phase, the hash values are partitioned into batches of similar size. While in the second phase, threads process the tuples assigned to them in parallel, and a token-ring scheme is used to avoid multiple threads to concurrently process tuples from the same batch. The experimental results show that the proposed approach has similar performance for uniform datasets compared to the lock-based and CAS-based hashing algorithms. However, for high-skewed datasets, the performance is enhanced by an order of magnitude. This indicates that our approach has a better resilience for skewed workloads.
机译:由于多核协处理器具有高度并行的计算能力和出色的能源效率,因此已广泛应用于当前的高性能计算领域。在许多数据密集型应用程序中,尤其是在主内存数据库系统中,哈希表是最常用和最重要的数据结构之一。由于构建哈希表的操作是一项写密集型活动,因此在高并发环境中进行此工作的最大挑战是多线程同步,尤其是在工作负载严重偏斜的情况下。为了克服这个问题,我们提出了一种有效的并行哈希表构造方案,该方案包括SIMD感知的数据结构和一种新颖的哈希同步处理算法,称为流水线哈希同步(PHS)。 PHS分两个阶段执行:在第一阶段,将哈希值划分为大小相似的批处理。在第二阶段中,线程并行处理分配给它们的元组,并且使用令牌环方案来避免多个线程同时处理同一批中的元组。实验结果表明,与基于锁和基于CAS的哈希算法相比,该方法对于统一数据集具有相似的性能。但是,对于高度偏斜的数据集,性能提高了一个数量级。这表明我们的方法对于偏斜的工作负载具有更好的弹性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号