首页> 外文会议>International conference on Very Large Data Bases >Hamming Filter: A Dynamic Signature File Organization for Parallel Stores
【24h】

Hamming Filter: A Dynamic Signature File Organization for Parallel Stores

机译:汉明过滤器:一个动态签名文件组织,用于并行存储

获取原文

摘要

Partitioning, in general, has become the basic strategy for organizing data files to avoid an exhaustive search when executing queries. However, hardware limitations that constrain the performance of query execution mainly become a problem for partial-match queries, where the size of the result can equal the size of the data file. In such situations, a proper application of parallelism can bring the required breakthrough in performance. Hamming Filter is a parallel, partitioned organization of signature files that are stored in fixed size buckets with a guaranteed load and is based on the idea of linear code decomposition. It can efficiently manage dynamic data files by means of a partitioned structure that always grows and shrinks linearly and is appropriate to multidimensional partitioning and searching. This paper proves that the organization yields no expected execution skew for partial-match queries, provided the data is not skewed and the degree of parallelism is a power of two.
机译:通常,分区已成为组织数据文件的基本策略,以避免在执行查询时进行详尽的搜索。然而,限制查询执行性能的硬件限制主要成为部分匹配查询的问题,其中结果的大小可以等于数据文件的大小。在这种情况下,适当地应用并行性可以使性能所需的突破。汉明滤波器是一个并行的,分区组织的签名文件,存储在固定大小桶中,具有保证的负载,并且基于线性码分解的思想。它可以通过分区结构有效地管理动态数据文件,这些结构总是成长并线性缩小,并且适合多维分区和搜索。本文证明,该组织不会产生部分匹配查询的预期执行偏差,只要数据不倾斜,并行度是两者的力量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号