首页> 外文会议>European Conference on Artificial Intelligence >Adaptive Binary Quantization for Fast Nearest Neighbor Search
【24h】

Adaptive Binary Quantization for Fast Nearest Neighbor Search

机译:快速最近邻搜索的自适应二进制量化

获取原文

摘要

Hashing has been proved an attractive technique for fast nearest neighbor search over big data. Compared to the projection based hashing methods, prototype based ones own stronger capability of generating discriminative binary codes for the data with complex inherent structure. However, our observation indicates that they still suffer from the insufficient coding that usually utilizes the complete binary codes in a hypercube. To address this problem, we propose an adaptive binary quantization method that learns a discriminative hash function with prototypes correspondingly associated with small unique binary codes. Our alternating optimization adaptively discovers the prototype set and the code set of a varying size in an efficient way, which together robustly approximate the data relations. Our method can be naturally generalized to the product space for long hash codes. We believe that our idea serves as a very helpful insight to hashing research. The extensive experiments on four large-scale (up to 80 million) datasets demonstrate that our method significantly outperforms state-of-the-art hashing methods, with up to 58.84% performance gains relatively.
机译:哈希已被证明是一个有吸引力的快速最近邻的技术搜索大数据。与基于投影的散列方法相比,基于原型的基于特性为具有复杂固有结构的数据生成识别二元码的能力。然而,我们的观察表明它们仍然遭受不足的编码,这些编码通常在HyperCube中使用完整的二进制代码。为了解决这个问题,我们提出了一种自适应二进制量化方法,其学习具有与小型独特二进制代码相应相关的原型的鉴别性哈希函数。我们的交替优化以有效的方式自适应地发现了不同大小的原型集和代码集,该尺寸​​在一起鲁棒地近似于数据关系。我们的方法可以自然地广泛地推广到用于长哈希码的产品空间。我们相信我们的想法是对哈希研究的非常有用的见解。四种大规模(高达8000万)数据集的广泛实验表明,我们的方法显着优于最先进的散列方法,相对高达58.84%的性能增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号