首页> 外文期刊>International Journal of Computer Vision >Structured Learning of Binary Codes with Column Generation for Optimizing Ranking Measures
【24h】

Structured Learning of Binary Codes with Column Generation for Optimizing Ranking Measures

机译:具有列生成的二元码的结构化学习,用于优化排名措施

获取原文
获取原文并翻译 | 示例
           

摘要

Hashing methods aim to learn a set of hash functions which map the original features to compact binary codes with similarity preserving in the Hamming space. Hashing has proven a valuable tool for large-scale information retrieval. We propose a column generation based binary code learning framework for data-dependent hash function learning. Given a set of triplets that encode the pairwise similarity comparison information, our column generation based method learns hash functions that preserve the relative comparison relations within the large-margin learning framework. Our method iteratively learns the best hash functions during the column generation procedure. Existing hashing methods optimize over simple objectives such as the reconstruction error or graph Laplacian related loss functions, instead of the performance evaluation criteria of interest-multivariate performance measures such as the AUC and NDCG. Our column generation based method can be further generalized from the triplet loss to a general structured learning based framework that allows one to directly optimize multivariate performance measures. For optimizing general ranking measures, the resulting optimization problem can involve exponentially or infinitely many variables and constraints, which is more challenging than standard structured output learning. We use a combination of column generation and cutting-plane techniques to solve the optimization problem. To speed-up the training we further explore stage-wise training and propose to optimize a simplified NDCG loss for efficient inference. We demonstrate the generality of our method by applying it to ranking prediction and image retrieval, and show that it outperforms several state-of-the-art hashing methods.
机译:哈希方法旨在学习一组散列函数,将原始功能映射到紧凑的二进制代码,其具有保留在汉明空间中的相似性。哈希证明了一个有价值的工具,用于大规模信息检索。我们提出了一种基于列生成的二进制代码学习框架,用于数据依赖哈希函数学习。给定一组三胞胎,它们编码了一组相似性比较信息,我们的列生成的方法学习散列函数,以保留大边缘学习框架内的相对比较关系。我们的方法迭代地在列生成过程中学习最佳哈希函数。现有的散列方法优化了诸如重建误差或图拉普拉斯相关损失函数之类的简单目标,而不是兴趣 - 多变量性能措施(如AUC和NDCG)的性能评估标准。我们的列生成的方法可以从三联体丢失进一步推广到基于一般的结构化学习的框架,允许人们直接优化多变量性能措施。为了优化一般排名措施,所得到的优化问题可以呈指数级或无限的许多变量和约束,这比标准结构化输出学习更具挑战性。我们使用列生成和剥离机技术的组合来解决优化问题。加快培训我们进一步探索舞台明智的培训,并建议优化简化的NDCG损耗以获得有效推理。我们通过将其应用于预测和图像检索来展示我们方法的一般性,并表明它优于几种最先进的散列方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号