首页> 外文会议>International Joint Conference on Artificial Intelligence >Ranking Preserving Hashing for Fast Similarity Search
【24h】

Ranking Preserving Hashing for Fast Similarity Search

机译:为快速相似性搜索排名保留散列

获取原文

摘要

Hashing method becomes popular for large scale similarity search due to its storage and computational efficiency. Many machine learning techniques, ranging from unsupervised to supervised, have been proposed to design compact hashing codes. Most of the existing hashing methods generate binary codes to efficiently find similar data examples to a query. However, the ranking accuracy among the retrieved data examples is not modeled. But in many real world applications, ranking measure is important for evaluating the quality of hashing codes. In this paper, we propose a novel Ranking Preserving Hashing (RPH) approach that directly optimizes a popular ranking measure, Normalized Discounted Cumulative Gain (NDCG), to obtain effective hashing codes with high ranking accuracy. The main difficulty in the direct optimization of NDCG measure is that it depends on the ranking order of data examples, which forms a non-convex non-smooth optimization problem. We address this challenge by optimizing the expectation of NDCG measure calculated based on a linear hashing function. A gradient descent method is designed to achieve the goal. An extensive set of experiments on two large scale datasets demonstrate the superior ranking performance of the proposed approach over several state-of-the-art hashing methods.
机译:由于其存储和计算效率,散列方法对于大规模相似性搜索变得流行。已经提出了许多机器学习技术,从无监督监督,设计了紧凑的散列代码。大多数现有散列方法生成二进制代码,以便有效地查找类似的数据示例。但是,未建模检索到的数据示例之间的排名精度。但在许多现实世界应用中,排名措施对于评估散列代码的质量很重要。在本文中,我们提出了一种新的排名保存散列(RPH)方法,即直接优化流行的排名测量,归一化折扣累积增益(NDCG),以获得具有高排名准确度的有效散列代码。 NDCG措施直接优化的主要困难是它取决于数据示例的排名顺序,这形成了非凸非平滑优化问题。我们通过优化基于线性散列函数计算的NDCG度量的期望来解决这一挑战。梯度下降方法旨在实现目标。在两个大型数据集上进行了广泛的实验,展示了在几种最先进的散列方法上提出了所提出的方法的卓越等级性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号