首页> 外文会议>Pacific Rim international conference on artificial intelligence >MIDCN: A Multiple Instance Deep Convolutional Network for Image Classification
【24h】

MIDCN: A Multiple Instance Deep Convolutional Network for Image Classification

机译:MIDCN:用于图像分类的多实例深度卷积网络

获取原文

摘要

For the image classification task, usually, the image collected in the wild contains multiple objects instead of a single dominant one. Besides, the image label is not explicitly associated with the object region, i.e., it is weakly annotated. In this paper, we propose a novel deep convolutional network for image classification under a weakly supervised condition. The proposed method, namely MIDCN, formulate the problem into Multiple Instance Learning (MIL), where each image is a bag which contains multiple instances (objects). Different with previous deep MfL methods which predict the label of each bag (i.e., image) by simply performing pooling/voting strategy over their instance (i.e., region) predictions, MIDCN directly predicts the label of a bag via bag features learned by measuring the similarities between instance features and a set of learned informative prototypes. Specifically, the prototypes are obtained by a newly proposed Global Contrast Pooling (GCP) layer which leverages instances not only coming from the current bag but also the other bags. Thus the learned bag features also contain global information of all the training bags, which is more robust and noise free. We did extensive experiments on two real-world image datasets, including both natural image dataset (PASCAL VOC 07) and pathological lung cancer image dataset, and show the results of the proposed MIDCN consistently outperforms the state-of-the-art methods.
机译:对于图像分类任务,通常,在野外收集的图像包含多个对象,而不是单个主要对象。此外,图像标签没有明确地与对象区域相关联,即,其被弱地注释。在本文中,我们提出了一种在弱监督条件下用于图像分类的新型深度卷积网络。所提出的方法,即MIDCN,将问题表达为多实例学习(MIL),其中每个图像都是一个包含多个实例(对象)的袋子。与以前的深层MfL方法不同,MIDCN通过简单地通过对其实例(即区域)预测执行合并/投票策略来预测每个袋子的标签(即图像),而MIDCN则通过测量包装袋特征来直接预测袋子的标签。实例特征和一组学习到的信息性原型之间的相似性。具体而言,原型是通过新提议的全局对比度池(GCP)层获得的,该层不仅利用了来自当前袋子的实例,还利用了其他袋子的实例。因此,学习袋特征还包含所有训练袋的全局信息,这更加健壮且无噪音。我们对两个真实世界的图像数据集进行了广泛的实验,包括自然图像数据集(PASCAL VOC 07)和病理性肺癌图像数据集,并显示了提出的MIDCN的结果始终优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号