首页> 外文会议>SIAM International Conference on Data Mining >Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes
【24h】

Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes

机译:哈希自适应量化:基于信息的学习二进制代码方法

获取原文

摘要

Large-scale data mining and retrieval applications have increasingly turned to compact binary data representations as a way to achieve both fast queries and efficient data storage; many algorithms have been proposed for learning effective binary encodings. Most of these algorithms focus on learning a set of projection hyperplanes for the data and simply binarizing the result from each hyperplane, but this neglects the fact that informative-ness may not be uniformly distributed across the projections. In this paper, we address this issue by proposing a novel adaptive quantization (AQ) strategy that adaptively assigns varying numbers of bits to different hyperplanes based on their information content. Our method provides an information-based schema that preserves the neighborhood structure of data points, and we jointly find the globally optimal bit-allocation for all hyperplanes. In our experiments, we compare with state-of-the-art methods on four large-scale datasets and find that our adaptive quantization approach significantly improves on traditional hashing methods.
机译:大规模数据挖掘和检索应用程序越来越转向紧凑的二进制数据表示,作为实现快速查询和有效数据存储的方式;已经提出了许多算法用于学习有效的二进制编码。这些算法中的大多数都侧重于学习数据的一组投影超平面,并简单地二进制化每个超平面的结果,但这忽略了信息 - NESS可能不会均匀分布在预测上的事实。在本文中,我们通过提出新的自适应量化(AQ)策略来解决这个问题,该策略基于其信息内容自适应地将不同数量的比特分配给不同的超平面。我们的方法提供了一种基于信息的模式,可以保留数据点的邻域结构,并且我们共同查找所有超平面的全局最佳比特分配。在我们的实验中,我们在四个大规模数据集中与最先进的方法进行比较,并发现我们的自适应量化方法显着提高了传统的散列方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号