首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >Locally Optimized Hashing for Nearest Neighbor Search
【24h】

Locally Optimized Hashing for Nearest Neighbor Search

机译:用于最近邻搜索的本地优化散列

获取原文

摘要

Fast nearest neighbor search (NNS) is becoming important to utilize massive data. Recent work shows that hash learning is effective for NNS in terms of computational time and space. Existing hash learning methods try to convert neighboring samples to similar binary codes, and their hash functions are globally optimized on the data manifold. However, such hash functions often have low resolution of binary codes; each bucket, a set of samples with same binary code, may contain a large number of samples in these methods, which makes it infeasible to obtain the nearest neighbors of given query with high precision. As a result, existing methods require long binary codes for precise NNS. In this paper, we propose Locally Optimized Hashing to overcome this drawback, which explicitly partitions each bucket by solving optimization problem based on that of Spectral Hashing with stronger constraints. Our method outperforms existing methods in image and document datasets in terms of quality of both the hash table and query, especially when the code length is short.
机译:快速最近的邻南搜索(NNS)正在变得越来越重要,可以利用大规模数据。最近的工作表明,在计算时间和空间方面,哈希学习对NNS有效。现有的哈希学习方法尝试将邻近的样本转换为类似的二进制代码,并且它们的散列函数在数据歧管上全局优化。但是,这种散列函数通常具有低分辨率二进制代码;每个桶,具有相同二进制代码的一组样本,可以包含这些方法中的大量样本,这使得能够高精度地获得给定查询的最近邻居不可行。因此,现有方法需要长二进制代码以精确NNS。在本文中,我们提出了局部优化的散列以克服该缺点,通过基于具有更强的限制的光谱散列来解决优化问题,明确地分区每个桶。我们的方法在哈希表和查询的质量方面优于图像和文档数据集中的现有方法,尤其是当代码长度短时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号