...
首页> 外文期刊>Neurocomputing >Aggregating hierarchical binary activations for image retrieval
【24h】

Aggregating hierarchical binary activations for image retrieval

机译:聚合分层二进制激活进行图像检索

获取原文
获取原文并翻译 | 示例
           

摘要

Convolutional Neural Networks (CNNs) have achieved a breakthrough on a large number of image retrieval benchmarks. However, most previous works make use of the CNNs following the image classification strategy, where the last fully connected layer activations of the whole image are occupied as a single holistic feature vector. To improve the representation power of CNNs, this paper proposes a Multilayer Fusion (MF) approach to aggregate deep activations for image retrieval task. The key insight of our approach is that different layers of a CNN are sensitive to specific patterns, and are complementary with each other for image representation. Specifically, our approach transforms CNN activations to deep binary codes embedded in the inverted index of Bag-of-Words structure for fast retrieval. Those activations are derived from multiple layers of a CNN on local patches, for features from orderless local areas have proved superior to global ones in the low level handcrafted cases. Corresponding weights and diffusion process are thereafter utilized to penalize and re-rank the individual similarity scores of layers. Our method is efficient, which extracts visual features from different layers only once. Furthermore, the proposed MF approach can be easily extended to include SIFT features to enhance the representation power. Extensive experiments on four public retrieval datasets quantitatively evaluate the effectiveness of our contributions, and the proposed algorithm prove to be the new state-of-the-art on the Holidays and UKBench datasets. (C) 2018 Elsevier B.V. All rights reserved.
机译:卷积神经网络(CNN)在大量图像检索基准方面取得了突破。但是,大多数先前的工作都遵循图像分类策略来使用CNN,在该策略中,整个图像的最后完全连接的层激活被作为单个整体特征矢量占据。为了提高CNN的表示能力,本文提出了一种多层融合(MF)方法来聚合用于图像检索任务的深度激活。我们方法的关键见解是CNN的不同层对特定模式敏感,并且在图像表示方面彼此互补。具体来说,我们的方法将CNN激活转换为嵌入在Word of Words结构的反向索引中的深层二进制代码,以便快速检索。这些激活来自本地补丁上CNN的多层,因为在低水平手工制作的情况下,无序局部区域的功能已被证明优于全局区域。此后,利用相应的权重和扩散过程来惩罚各个层的相似度得分并对其重新排序。我们的方法是有效的,它仅从一次提取不同图层的视觉特征。此外,所提出的MF方法可以容易地扩展为包括SIFT特征以增强表示能力。在四个公共检索数据集上进行的大量实验定量评估了我们的贡献的有效性,并且所提出的算法被证明是Holidays和UKBench数据集上的最新技术。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号