首页> 外文会议>IEEE International Conference of Intelligent Robotic and Control Engineering >Image Retrieval Based on a Hybrid Model of Deep Convolutional Encoder
【24h】

Image Retrieval Based on a Hybrid Model of Deep Convolutional Encoder

机译:基于深度卷积编码器混合模型的图像检索

获取原文

摘要

Aiming at the difficulty of semantic gap in content-based image search (CBIR), inspired by the convolutional neural network (CNN) in image classification and detection, this paper proposes a simple and effective hybrid model of deep convolutional network and autoencoder network. This model uses the CNN network to extract the high-level semantic features of the image, then uses the depth autoencoder network to reduce the dimension of the extracted image features, and compresses the features into a 128-bit vector representation. Nearest Neighbor Search (ANN) is an effective strategy for large-scale image retrieval. This paper uses the annoy algorithm to calculate the similarity between the query image and the index tree, and outputs them in descending order of similarity.Experimental results show that the proposed method outperforms some of the latest deep-network image retrieval algorithms on the CIFAR-10 and MNIST datasets. In the TOP10 image search, the MNSIT dataset can obtain 100% accuracy. In the CIFAR dataset experiment, the accuracy and recall rate of the CIFAR4 dataset are as high as 99.9%, and the accuracy and recall rate of the CIFAR10 dataset reach respectively 97.2% and 98.1%. In addition, the size of the convolutional network's parameters and the size of the index are optimized compared to the previous model, so that the effect of second-level real-time response can be achieved in the 10,000-level image search.
机译:针对卷积神经网络(CNN)在图像分类和检测中的应用,基于内容的图像搜索(CBIR)存在语义鸿沟的困难,提出了一种简单有效的深度卷积网络与自动编码器混合模型。该模型使用CNN网络提取图像的高级语义特征,然后使用深度自动编码器网络减小提取的图像特征的维数,并将特征压缩为128位矢量表示。最近邻搜索(ANN)是一种用于大规模图像检索的有效策略。本文使用烦人的算法来计算查询图像和索引树之间的相似度,并以相似度从高到低的顺序输出。实验结果表明,该方法在CIFAR-上优于某些最新的深层网络图像检索算法。 10和MNIST数据集。在TOP10图像搜索中,MNSIT数据集可以获得100%的准确性。在CIFAR数据集实验中,CIFAR4数据集的准确性和召回率高达99.9%,CIFAR10数据集的准确性和召回率分别达到97.2%和98.1%。另外,与以前的模型相比,卷积网络参数的大小和索引的大小都得到了优化,因此可以在10,000级图像搜索中实现二级实时响应的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号