首页> 外文期刊>Pattern recognition letters >Locality sensitive hashing: A comparison of hash function types and querying mechanisms
【24h】

Locality sensitive hashing: A comparison of hash function types and querying mechanisms

机译:局部敏感哈希:哈希函数类型和查询机制的比较

获取原文
获取原文并翻译 | 示例
           

摘要

It is well known that high-dimensional nearest neighbor retrieval is very expensive. Dramatic performance gains are obtained using approximate search schemes, such as the popular Locality-Sensitive Hashing (LSH). Several extensions have been proposed to address the limitations of this algorithm, in particular, by choosing more appropriate hash functions to better partition the vector space. All the proposed extensions, however, rely on a structured quantizer for hashing, poorly fitting real data sets, limiting its performance in practice. In this paper, we compare several families of space hashing functions in a real setup, namely when searching for high-dimension SIFT descriptors. The comparison of random projections, lattice quantizers,k-means and hierarchical k-means reveal that unstructured quantizer significantly improves the accuracy of LSH, as it closely fits the data in the feature space. We then compare two querying mechanisms introduced in the literature with the one originally proposed in LSH, and discuss their respective merits and limitations.
机译:众所周知,高维最近邻检索非常昂贵。使用近似的搜索方案(例如流行的“局部敏感哈希”(LSH))可以获得显着的性能提升。已经提出了几种扩展来解决该算法的局限性,特别是通过选择更合适的哈希函数来更好地划分向量空间。但是,所有建议的扩展都依赖于结构化的量化器进行散列,从而无法拟合真实数据集,从而限制了其在实践中的性能。在本文中,我们在实际设置中(即搜索高维SIFT描述符时)比较了几类空间哈希函数。通过比较随机投影,点阵量化器,k均值和分层k均值,非结构化量化器可以将LSH精确地拟合到特征空间中,从而显着提高LSH的准确性。然后,我们将文献中介绍的两种查询机制与LSH中最初提出的两种查询机制进行比较,并讨论它们各自的优缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号