首页> 外文期刊>IEEE Transactions on Image Processing >Hash Bit Selection for Nearest Neighbor Search
【24h】

Hash Bit Selection for Nearest Neighbor Search

机译:用于最近邻居搜索的哈希位选择

获取原文
获取原文并翻译 | 示例
       

摘要

To overcome the barrier of storage and computation when dealing with gigantic-scale data sets, compact hashing has been studied extensively to approximate the nearest neighbor search. Despite the recent advances, critical design issues remain open in how to select the right features, hashing algorithms, and/or parameter settings. In this paper, we address these by posing an optimal hash bit selection problem, in which an optimal subset of hash bits are selected from a pool of candidate bits generated by different features, algorithms, or parameters. Inspired by the optimization criteria used in existing hashing algorithms, we adopt the bit reliability and their complementarity as the selection criteria that can be carefully tailored for hashing performance in different tasks. Then, the bit selection solution is discovered by finding the best tradeoff between search accuracy and time using a modified dynamic programming method. To further reduce the computational complexity, we employ the pairwise relationship among hash bits to approximate the high-order independence property, and formulate it as an efficient quadratic programming method that is theoretically equivalent to the normalized dominant set problem in a vertex- and edge-weighted graph. Extensive large-scale experiments have been conducted under several important application scenarios of hash techniques, where our bit selection framework can achieve superior performance over both the naive selection methods and the state-of-the-art hashing algorithms, with significant accuracy gains ranging from 10% to 50%, relatively.
机译:为了克服在处理大规模数据集时的存储和计算障碍,已经广泛研究了紧凑散列以近似最近的邻居搜索。尽管有最近的进展,但是如何选择正确的功能,哈希算法和/或参数设置仍然存在关键的设计问题。在本文中,我们通过提出一个最佳的哈希位选择问题来解决这些问题,其中从不同特征,算法或参数生成的候选位池中选择哈希位的最佳子集。受现有哈希算法中使用的优化标准的启发,我们采用比特可靠性及其互补性作为选择标准,可以针对不同任务中的哈希性能精心定制这些选择标准。然后,通过使用改进的动态编程方法在搜索精度和时间之间找到最佳折衷,来发现位选择解决方案。为了进一步降低计算复杂度,我们使用哈希位之间的成对关系来逼近高阶独立性,并将其公式化为一种有效的二次编程方法,在理论上等效于顶点和边沿中的归一化主导集问题。加权图。在散列技术的几个重要应用场景下,已经进行了广泛的大规模实验,其中我们的位选择框架可以比朴素的选择方法和最新的散列算法都实现卓越的性能,并且精度显着提高,从相对来说是10%到50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号