首页> 外文会议>International Conference on Pattern Recognition Applications and Methods >CRVM: Circular Random Variable-based Matcher - A Novel Hashing Method for Fast NN Search in High-dimensional Spaces
【24h】

CRVM: Circular Random Variable-based Matcher - A Novel Hashing Method for Fast NN Search in High-dimensional Spaces

机译:CRVM:圆形随机可变基于可变的匹配 - 一种新型散列方法,用于快速NN搜索在高维空间中

获取原文

摘要

Nearest Neighbour (NN) search is an essential and important problem in many areas, including multimedia databases, data mining and computer vision. For low-dimensional spaces a variety of tree-based NN search algorithms efficiently cope with finding the NN, for high-dimensional spaces, however, these methods are in-efficient. Even for Locality Sensitive Hashing (LSH) methods which solve the task approximately by grouping sample points that are nearby in the search space into buckets, it is difficult to find the right parameters. In this paper, we propose a novel hashing method that ensures a high probability of NNs being located in the same hash buckets and a balanced distribution of data across all the buckets. The proposed method is based on computing a selected number of pairwise uncorrelated and uniformly-distributed Circular Random Variables (CRVs) from the sample points. The method has been tested on a large dataset of SIFT features and was compared to LSH and the Fast Library for Approximated NN search (FLANN) matcher with linear search as the base line. The experimental results show that our method significantly reduces the search query time while preserving the search quality, in particular for dynamic databases and small databases whose size does not exceed 200k points.
机译:最近的邻居(NN)搜索是许多领域的重要和重要问题,包括多媒体数据库,数据挖掘和计算机视觉。对于低维空间的基于树的基于树的NN搜索算法有效地应对NN,对于高维空间,这些方法有效。甚至对于派对散列(LSH)方法,该方法甚至通过在搜索空间中附近的样本点对铲斗进行分组,甚至可以通过将附近的样本点进行分组,很难找到正确的参数。在本文中,我们提出了一种新颖的散列方法,该方法可确保NNS位于相同哈希桶中的高概率和跨越所有铲斗的数据的平衡分布。所提出的方法基于从采样点计算所选择的成对不相关和均匀分布的圆形随机变量(CRV)。该方法已经在SIFT特征的大型数据集上进行了测试,并与LSH和快速库进行比较,用于近似NN搜索(FLANN)匹配器作为基线。实验结果表明,我们的方法在保留搜索质量的同时显着降低了搜索查询时间,特别是对于大小不超过200k点的动态数据库和小型数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号