...
首页> 外文期刊>Signal processing >An improved method of locality sensitive hashing for indexing large-scale and high-dimensional features
【24h】

An improved method of locality sensitive hashing for indexing large-scale and high-dimensional features

机译:一种用于索引大规模和高维特征的改进的局部敏感哈希方法

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, Locality sensitive hashing (LSH) has been popularly used as an effective and efficient index structure of multimedia signals. LSH is originally proposed for resolving the high-dimensional approximate similarity search problem. Until now, many kinds of variations of LSH have been proposed for large-scale indexing. Much of the interest is focused on improving the query accuracy for skewed data distribution and reducing the storage space. However, when using LSH, a final filtering process based on exact similarity measure is needed. When the dataset is large-scale, the number of points to be filtered becomes large. As a result, the filtering speed becomes the bottleneck of improving the query speed when the scale of data becomes larger and larger. Furthermore, we observe a "Non-Uniform" phenomenon in the most popular Euclidean LSH which can degrade the filtering speed dramatically. In this paper, a pivot-based algorithm is proposed to improve the filtering speed by using triangle inequality to prune the search process. Furthermore, a novel method to select an optimal pivot for even larger improvement is provided. The experimental results on two open large-scale datasets show that our method can significantly improve the query speed of Euclidean LSH.
机译:近年来,局域敏感哈希(LSH)已被广泛用作多媒体信号的有效索引结构。 LSH最初是为解决高维近似相似性搜索问题而提出的。到目前为止,已经提出了许多LSH的变体用于大规模索引。许多兴趣集中在提高偏斜数据分布的查询准确性和减少存储空间上。但是,当使用LSH时,需要基于精确相似性度量的最终过滤过程。当数据集规模较大时,要过滤的点数将变大。结果,当数据规模越来越大时,过滤速度成为提高查询速度的瓶颈。此外,我们在最流行的欧几里得LSH中观察到“非均匀”现象,这会大大降低过滤速度。本文提出了一种基于枢轴的算法,通过利用三角形不等式来简化搜索过程,从而提高了过滤速度。此外,提供了一种用于选择最佳枢轴以进行更大改进的新颖方法。在两个开放的大规模数据集上的实验结果表明,我们的方法可以显着提高欧几里得LSH的查询速度。

著录项

  • 来源
    《Signal processing》 |2013年第8期|2244-2255|共12页
  • 作者单位

    Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China,Graduate University of Chinese Academy of Sciences, 19A Yuquanlu, 100049 Beijing, China,Beijing Key Laboratory of Mobile Computing and Pervasive Device (Institute of Computing Technology, Chinese Academy of Sciences), No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China;

    Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China,Beijing Key Laboratory of Mobile Computing and Pervasive Device (Institute of Computing Technology, Chinese Academy of Sciences), No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China;

    Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China ,Graduate University of Chinese Academy of Sciences, 19A Yuquanlu, 100049 Beijing, China,Beijing Key Laboratory of Mobile Computing and Pervasive Device (Institute of Computing Technology, Chinese Academy of Sciences), No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China;

    Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China ,Beijing Key Laboratory of Mobile Computing and Pervasive Device (Institute of Computing Technology, Chinese Academy of Sciences), No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China;

    Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China ,Beijing Key Laboratory of Mobile Computing and Pervasive Device (Institute of Computing Technology, Chinese Academy of Sciences), No. 6 Kexueyuan South Road, Haidian District, 100190 Beijing, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Locality sensitive hashing; Large-scale; High-dimensional; "Non-Uniform" problem; Pivot-based filtering algorithm;

    机译:局部敏感哈希;大规模;高维;“非统一”问题;基于枢轴的过滤算法;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号