首页> 外文会议>Pacific Rim international conference on artificial intelligence >MIDCN: A Multiple Instance Deep Convolutional Network for Image Classification
【24h】

MIDCN: A Multiple Instance Deep Convolutional Network for Image Classification

机译:MIDCN:用于图像分类的多实例深卷积网络

获取原文

摘要

For the image classification task, usually, the image collected in the wild contains multiple objects instead of a single dominant one. Besides, the image label is not explicitly associated with the object region, i.e., it is weakly annotated. In this paper, we propose a novel deep convolutional network for image classification under a weakly supervised condition. The proposed method, namely MIDCN, formulate the problem into Multiple Instance Learning (MIL), where each image is a bag which contains multiple instances (objects). Different with previous deep MfL methods which predict the label of each bag (i.e., image) by simply performing pooling/voting strategy over their instance (i.e., region) predictions, MIDCN directly predicts the label of a bag via bag features learned by measuring the similarities between instance features and a set of learned informative prototypes. Specifically, the prototypes are obtained by a newly proposed Global Contrast Pooling (GCP) layer which leverages instances not only coming from the current bag but also the other bags. Thus the learned bag features also contain global information of all the training bags, which is more robust and noise free. We did extensive experiments on two real-world image datasets, including both natural image dataset (PASCAL VOC 07) and pathological lung cancer image dataset, and show the results of the proposed MIDCN consistently outperforms the state-of-the-art methods.
机译:对于图像分类任务,通常,野外收集的图像包含多个对象而不是单个主导的对象。此外,图像标签没有明确地与对象区域相关联,即,它是弱注释的。在本文中,我们提出了一种在弱监督条件下进行图像分类的新型深度卷积网络。所提出的方法,即MIDCN,将问题分为多实例学习(MIL),其中每个图像是包含多个实例(对象)的袋子。与先前的深MFL方法不同,通过简单地在其实例(即,区域)预测中,MIDCN通过测量来通过袋子特征直接预测袋子的标签来预测每个袋子(即,图像)的标签。通过测量,MIDCN直接预测通过袋子特征来预测袋子的标签实例特征和一组学习的信息原型之间的相似之序。具体地,原型是通过新的全局对比汇集(GCP)层获得的,该层不仅利用不仅来自当前袋子而且还具有其他袋子的情况。因此,学习的袋子功能还包含所有训练袋的全球信息,这是更强大的稳健和无噪声。我们在两个现实世界图像数据集中进行了广泛的实验,包括天然图像数据集(Pascal VOC 07)和病理肺癌图像数据集,并显示所提出的MIDCN的结果始终如一地优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号