首页> 外文期刊>Neurocomputing >Content-based image retrieval with compact deep convolutional features
【24h】

Content-based image retrieval with compact deep convolutional features

机译:具有紧凑的深度卷积特征的基于内容的图像检索

获取原文
获取原文并翻译 | 示例
           

摘要

Convolutional neural networks (CNNs) with deep learning have recently achieved a remarkable success with a superior performance in computer vision applications. Most of CNN-based methods extract image features at the last layer using a single CNN architecture with orderless quantization approaches, which limits the utilization of intermediate convolutional layers for identifying image local patterns. As one of the first works in the context of content-based image retrieval (CBIR), this paper proposes a new bilinear CNN-based architecture using two parallel CNNs as feature extractors. The activations of convolutional layers are directly used to extract the image features at various image locations and scales. The network architecture is initialized by deep CNNs sufficiently pre-trained on a large generic image dataset then fine-tuned for the CBIR task. Additionally, an efficient bilinear root pooling is proposed and applied to the low-dimensional pooling layer to reduce the dimension of image features to compact but high discriminative image descriptors. Finally, an end-to-end training with backpropagation is performed to fine-tune the final architecture and to learn its parameters for the image retrieval task. The experimental results achieved on three standard benchmarking image datasets demonstrate the outstanding performance of the proposed architecture at extracting and learning complex features for the CBIR task without prior knowledge about the semantic meta-data of images. For instance, using a very compact image vector of 16-length, we achieve a retrieval accuracy of 95.7% (mAP) on Oxford 5K and 88.6% on Oxford 105K; which outperforms the best results reported by state-of-the-art approaches. Additionally, a noticeable reduction is attained in the required extraction time for image features and the memory size required for storage. 2017 Elsevier B.V. All rights reserved.
机译:具有深度学习功能的卷积神经网络(CNN)最近在计算机视觉应用中取得了卓越的性能,并取得了令人瞩目的成功。大多数基于CNN的方法使用具有无序量化方法的单个CNN架构在最后一层提取图像特征,这限制了中间卷积层用于识别图像局部图案的利用。作为基于内容的图像检索(CBIR)上下文中的第一批工作之一,本文提出了一种使用两个并行CNN作为特征提取器的新的基于双线性CNN的体系结构。卷积层的激活直接用于提取各种图像位置和比例下的图像特征。网络架构由在大型通用图像数据集上进行充分预训练的深层CNN初始化,然后针对CBIR任务进行微调。此外,提出了一种有效的双线性根池并将其应用于低维池层,以将图像特征的维数减小为紧凑但具有高判别力的图像描述符。最后,进行反向传播的端到端训练,以微调最终体系结构并学习其用于图像检索任务的参数。在三个标准基准图像数据集上获得的实验结果证明了该结构在提取和学习CBIR任务的复杂特征方面的出色性能,而无需事先了解图像的语义元数据。例如,使用非常紧凑的16长度图像矢量,我们在Oxford 5K上的检索精度为95.7%(mAP),在Oxford 105K上的检索精度为88.6%;胜过最新方法报告的最佳结果。另外,图像特征所需的提取时间和存储所需的存储器大小显着减少。 2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号