首页> 外文会议>IEEE International Conference of Intelligent Robotic and Control Engineering >Image Retrieval Based on a Hybrid Model of Deep Convolutional Encoder
【24h】

Image Retrieval Based on a Hybrid Model of Deep Convolutional Encoder

机译:基于深卷积编码器混合模型的图像检索

获取原文

摘要

Aiming at the difficulty of semantic gap in content-based image search (CBIR), inspired by the convolutional neural network (CNN) in image classification and detection, this paper proposes a simple and effective hybrid model of deep convolutional network and autoencoder network. This model uses the CNN network to extract the high-level semantic features of the image, then uses the depth autoencoder network to reduce the dimension of the extracted image features, and compresses the features into a 128-bit vector representation. Nearest Neighbor Search (ANN) is an effective strategy for large-scale image retrieval. This paper uses the annoy algorithm to calculate the similarity between the query image and the index tree, and outputs them in descending order of similarity.Experimental results show that the proposed method outperforms some of the latest deep-network image retrieval algorithms on the CIFAR-10 and MNIST datasets. In the TOP10 image search, the MNSIT dataset can obtain 100% accuracy. In the CIFAR dataset experiment, the accuracy and recall rate of the CIFAR4 dataset are as high as 99.9%, and the accuracy and recall rate of the CIFAR10 dataset reach respectively 97.2% and 98.1%. In addition, the size of the convolutional network's parameters and the size of the index are optimized compared to the previous model, so that the effect of second-level real-time response can be achieved in the 10,000-level image search.
机译:针对基于内容的图像搜索(CBIR)的语义间隙的难度,由图像分类和检测中的卷积神经网络(CNN)的启发,提出了一种简单有效的深卷积网络和自动化网络的混合模型。该模型使用CNN网络提取图像的高电平语义特征,然后使用深度自动统计器网络来减少提取的图像特征的维度,并将其特征压缩到128位矢量表示中。最近的邻居搜索(ANN)是大规模图像检索的有效策略。本文使用令人作呕的算法计算查询图像和索引树之间的相似性,并以相似性降序输出它们。实验结果表明,所提出的方法优于CIFAR上的一些最新的深网络图像检索算法 - 10和Mnist数据集。在Top10图像搜索中,MNSIT数据集可以获得100%的精度。在CIFAR数据集实验中,CIFAR4数据集的精度和召回率高达99.9%,CIFAR10数据集的准确性和召回率分别达到97.2%和98.1%。另外,与先前的模型相比,卷积网络参数的大小和索引的大小进行了优化,从而可以在10,000级图像搜索中实现二级实时响应的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号