首页> 外文期刊>Advances in multimedia >Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval
【24h】

Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval

机译:结合卷积神经网络和马尔可夫随机场进行语义图像检索

获取原文
           

摘要

With the rapidly growing number of images over the Internet, efficient scalable semantic image retrieval becomes increasingly important. This paper presents a novel approach for semantic image retrieval by combining Convolutional Neural Network (CNN) and Markov Random Field (MRF). As a key step, image concept detection, that is, automatically recognizing multiple semantic concepts in an unlabeled image, plays an important role in semantic image retrieval. Unlike previous work that uses single-concept classifiers one by one, we detect semantic multiconcept by using a multiconcept scene classifier. In other words, our approach takes multiple concepts as a holistic scene for multiconcept scene learning. Specifically, we first train a CNN as a concept classifier, which further includes two types of classifiers a single-concept fully connected classifier that is best suited to single-concept detection and a multiconcept scene fully connected classifier that is good for holistic scene detection. Then we propose an MRF-based late fusion approach that is able to effectively learn the semantic correlation between the single-concept classifier and multiconcept scene classifier. Finally, the semantic correlation among the subconcepts of images is cought to further improve detection precision. In order to investigate the feasibility and effectiveness of our proposed approach, we conduct comprehensive experiments on two publicly available image databases. The results show that our proposed approach outperforms several state-of-the-art approaches.
机译:随着Internet上图像数量的迅速增长,有效的可伸缩语义图像检索变得越来越重要。本文结合卷积神经网络(CNN)和马尔可夫随机场(MRF),提出了一种新颖的语义图像检索方法。作为关键步骤,图像概念检测(即自动识别未标记图像中的多个语义概念)在语义图像检索中起着重要作用。与以前的使用一个概念一个分类器的工作不同,我们通过使用一个多概念场景分类器来检测语义多概念。换句话说,我们的方法将多个概念作为用于多概念场景学习的整体场景。具体来说,我们首先训练CNN作为概念分类器,它进一步包括两种类型的分类器:最适合单概念检测的单概念完全连接分类器和最适合整体场景检测的多概念场景完全连接分类器。然后,我们提出了一种基于MRF的后期融合方法,该方法能够有效地学习单概念分类器和多概念场景分类器之间的语义相关性。最后,寻求图像子概念之间的语义相关性,以进一步提高检测精度。为了调查我们提出的方法的可行性和有效性,我们对两个公开可用的图像数据库进行了全面的实验。结果表明,我们提出的方法优于几种最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号