CRVM: Circular Random Variable-based Matcher - A Novel Hashing Method for Fast NN Search in High-dimensional Spaces

机译：CRVM：圆形随机可变基于可变的匹配 - 一种新型散列方法，用于快速NN搜索在高维空间中

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nearest Neighbour (NN) search is an essential and important problem in many areas, including multimedia databases, data mining and computer vision. For low-dimensional spaces a variety of tree-based NN search algorithms efficiently cope with finding the NN, for high-dimensional spaces, however, these methods are in-efficient. Even for Locality Sensitive Hashing (LSH) methods which solve the task approximately by grouping sample points that are nearby in the search space into buckets, it is difficult to find the right parameters. In this paper, we propose a novel hashing method that ensures a high probability of NNs being located in the same hash buckets and a balanced distribution of data across all the buckets. The proposed method is based on computing a selected number of pairwise uncorrelated and uniformly-distributed Circular Random Variables (CRVs) from the sample points. The method has been tested on a large dataset of SIFT features and was compared to LSH and the Fast Library for Approximated NN search (FLANN) matcher with linear search as the base line. The experimental results show that our method significantly reduces the search query time while preserving the search quality, in particular for dynamic databases and small databases whose size does not exceed 200k points.

机译：最近的邻居（NN）搜索是许多领域的重要和重要问题，包括多媒体数据库，数据挖掘和计算机视觉。对于低维空间的基于树的基于树的NN搜索算法有效地应对NN，对于高维空间，这些方法有效。甚至对于派对散列（LSH）方法，该方法甚至通过在搜索空间中附近的样本点对铲斗进行分组，甚至可以通过将附近的样本点进行分组，很难找到正确的参数。在本文中，我们提出了一种新颖的散列方法，该方法可确保NNS位于相同哈希桶中的高概率和跨越所有铲斗的数据的平衡分布。所提出的方法基于从采样点计算所选择的成对不相关和均匀分布的圆形随机变量（CRV）。该方法已经在SIFT特征的大型数据集上进行了测试，并与LSH和快速库进行比较，用于近似NN搜索（FLANN）匹配器作为基线。实验结果表明，我们的方法在保留搜索质量的同时显着降低了搜索查询时间，特别是对于大小不超过200k点的动态数据库和小型数据库。

著录项

来源
《International Conference on Pattern Recognition Applications and Methods》|2018年|1(CD-ROM)|共8页
会议地点
作者
Faraj Alhwarin; Alexander Ferrein; Ingrid Scholl;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Feature Matching; Hash Tree; Fast NN Search;

机译：特征匹配;哈希树;快速NN搜索;

相似文献

外文文献
中文文献
专利

1. Fast Nearest Subspace Search via Random Angular Hashing [J] . Yi Xu, Xianglong Liu, Binshuai Wang, Multimedia, IEEE Transactions on . 2021,第1期

机译：快速最近的子空间通过随机角散列搜索
2. Fast Cosine Similarity Search in Binary Space with Angular Multi-Index Hashing [J] . Eghbali Sepehr, Tahvildari Ladan IEEE Transactions on Knowledge and Data Engineering . 2019,第2期

机译：角多索引散列在二进制空间中的快速余弦相似度搜索
3. Fast Exact Search in Hamming Space With Multi-Index Hashing [J] . Norouzi M., Punjani A., Fleet D.J. IEEE Transactions on Pattern Analysis and Machine Intelligence . 2014,第6期

机译：利用多索引散列在汉明空间中进行快速精确搜索
4. CRVM: Circular Random Variable-based Matcher - A Novel Hashing Method for Fast NN Search in High-dimensional Spaces [C] . Faraj Alhwarin, Alexander Ferrein, Ingrid Scholl International Conference on Pattern Recognition Applications and Methods . 2018

机译：CRVM：圆形随机可变基于可变的匹配 - 一种新型散列方法，用于快速NN搜索在高维空间中
5. Online Hashing for Fast Similarity Search [D] . Cakir, Fatih. 2017

机译：用于快速相似性搜索的在线散列
6. FSH: fast spaced seed hashing exploiting adjacent hashes [O] . Samuele Girotto, Matteo Comin, Cinzia Pizzi 2018

机译：FSH：利用相邻散列的快速间隔种子哈希
7. 1Fast Exact Search in Hamming Space with Multi-Index Hashing [O] . Mohammad Norouzi, Ali Punjani, David J. Fleet 2016

机译：1具有多指标哈希的汉明空间中的快速搜索

CRVM: Circular Random Variable-based Matcher - A Novel Hashing Method for Fast NN Search in High-dimensional Spaces

摘要

著录项

相似文献

相关主题

期刊订阅