【24h】

SIMD-Based Decoding of Posting Lists

机译:基于SIMD的发布列表解码

获取原文
获取外文期刊封面目录资料

摘要

Powerful SIMD instructions in modern processors offer an opportunity for greater search performance. In this paper, we apply these instructions to decoding search engine posting lists. We start by exploring variable-length integer encoding formats used to represent postings. We define two properties, byte-oriented and byte-preserving, that characterize many formats of interest. Based on their common structure, we define a taxonomy that classifies encodings along three dimensions, representing the way in which data bits are stored and additional bits are used to describe the data. Using this taxonomy, we discover new encoding formats, some of which are particularly amenable to SIMD-based decoding. We present generic SIMD algorithms for decoding these formats. We also extend these algorithms to the most common traditional encoding format. Our experiments demonstrate that SIMD-based decoding algorithms are up to 3 times faster than non-SIMD algorithms.
机译:现代处理器中强大的SIMD说明提供了更大的搜索性能的机会。在本文中,我们将这些说明应用于解码搜索引擎发布列表。我们首先探索用于表示发布的可变长度整数编码格式。我们定义了两个属性,面向字节和字节保留,其表征了许多感兴趣的格式。基于它们的共同结构,我们定义了一种分类,其沿三个维度对编码进行分类,代表存储数据比特的方式,并且使用附加位来描述数据。使用此分类法,我们发现新的编码格式,其中一些尤其适用于基于SIMD的解码。我们提供通用SIMD算法以解码这些格式。我们还将这些算法扩展到最常见的传统编码格式。我们的实验表明,基于SIMD的解码算法比非SIMD算法快3倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号