首页> 外文会议>Annual conference on Neural Information Processing Systems >Angular Quantization-based Binary Codes for Fast Similarity Search
【24h】

Angular Quantization-based Binary Codes for Fast Similarity Search

机译:基于角度量化的二进制代码,用于快速相似性搜索

获取原文

摘要

This paper focuses on the problem of learning binary codes for efficient retrieval of high-dimensional non-negative data that arises in vision and text applications where counts or frequencies are used as features. The similarity of such feature vectors is commonly measured using the cosine of the angle between them. In this work, we introduce a novel angular quantization-based binary coding (AQBC) technique for such data and analyze its properties. In its most basic form, AQBC works by mapping each non-negative feature vector onto the vertex of the binary hypercube with which it has the smallest angle. Even though the number of vertices (quantization landmarks) in this scheme grows exponentially with data dimensionality d, we propose a method for mapping feature vectors to their smallest-angle binary vertices that scales as O(d log d). Further, we propose a method for learning a linear transformation of the data to minimize the quantization error, and show that it results in improved binary codes. Experiments on image and text datasets show that the proposed AQBC method outperforms the state of the art.
机译:本文着重于学习二进制代码的问题,以有效地检索以计数或频率为特征的视觉和文本应用程序中出现的高维非负数据。通常使用它们之间的角度的余弦来测量此类特征向量的相似性。在这项工作中,我们为此类数据引入了一种新颖的基于角度量化的二进制编码(AQBC)技术,并分析了其特性。在最基本的形式中,AQBC的工作原理是将每个非负特征向量映射到具有最小角度的二进制超立方体的顶点上。即使此方案中的顶点(量化界标)的数量随数据维数d呈指数增长,我们仍提出了一种将特征向量映射到其最小角度二进制顶点的方法,该最小尺度二进制顶点的缩放比例为O(d log d)。此外,我们提出了一种用于学习数据的线性变换以最小化量化误差的方法,并表明它可以改善二进制代码。在图像和文本数据集上的实验表明,所提出的AQBC方法优于最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号