首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Learning Deep Features for Discriminative Localization
【24h】

Learning Deep Features for Discriminative Localization

机译:学习深度功能以进行区分式本地化

获取原文

摘要

In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network (CNN) to have remarkable localization ability despite being trained on imagelevel labels. While this technique was previously proposed as a means for regularizing training, we find that it actually builds a generic localizable deep representation that exposes the implicit attention of CNNs on an image. Despite the apparent simplicity of global average pooling, we are able to achieve 37.1% top-5 error for object localization on ILSVRC 2014 without training on any bounding box annotation. We demonstrate in a variety of experiments that our network is able to localize the discriminative image regions despite just being trained for solving classification task1.
机译:在这项工作中,我们回顾了[13]中提出的全局平均池化层,并阐明了它如何显式地使卷积神经网络(CNN)具有卓越的定位能力,尽管在图像级标签上进行了训练。尽管以前曾提议将该技术用作正规化训练的一种方法,但我们发现它实际上建立了一个通用的可本地化的深层表示,该深层表示将CNN的隐式注意力暴露在图像上。尽管全局平均池的表面看起来很简单,但我们无需进行任何边界框注释训练,就能够在ILSVRC 2014上实现37.1%的top-5错误进行对象定位。我们在各种实验中证明,尽管我们的网络经过训练以解决分类任务1,但我们的网络仍能够定位可区分的图像区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号