首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation
【24h】

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation

机译:重温扩张卷积:一种用于弱监督和半监督语义分割的简单方法

获取原文

摘要

Despite the remarkable progress, weakly supervised segmentation approaches are still inferior to their fully supervised counterparts. We obverse the performance gap mainly comes from their limitation on learning to produce high-quality dense object localization maps from image-level supervision. To mitigate such a gap, we revisit the dilated convolution [1] and reveal how it can be utilized in a novel way to effectively overcome this critical limitation of weakly supervised segmentation approaches. Specifically, we find that varying dilation rates can effectively enlarge the receptive fields of convolutional kernels and more importantly transfer the surrounding discriminative information to non-discriminative object regions, promoting the emergence of these regions in the object localization maps. Then, we design a generic classification network equipped with convolutional blocks of different dilated rates. It can produce dense and reliable object localization maps and effectively benefit both weakly- and semi- supervised semantic segmentation. Despite the apparent simplicity, our proposed approach obtains superior performance over state-of-the-arts. In particular, it achieves 60.8% and 67.6% mIoU scores on Pascal VOC 2012 test set in weakly- (only image-level labels are available) and semi- (1,464 segmentation masks are available) supervised settings, which are the new state-of-the-arts.
机译:尽管取得了显着进展,但弱监督的细分方法仍然不如完全监督的同类方法。我们发现性能差距主要来自它们在学习从图像级监督生成高质量密集对象定位图方面的局限性。为了减轻这种差距,我们重新研究了膨胀卷积[1],并揭示了如何以新颖的方式利用它来有效克服弱监督分割方法的这一关键局限性。具体来说,我们发现,不同的膨胀率可以有效地扩大卷积核的接受范围,更重要的是将周围的区分信息转移到非歧视性对象区域,从而促进这些区域在对象定位图中的出现。然后,我们设计了一个通用分类网络,该分类网络配备了不同膨胀率的卷积块。它可以产生密集且可靠的对象定位图,并有效地使弱监督和半监督语义分割受益。尽管表面上看起来很简单,但我们提出的方法仍能获得优于最新技术的性能。特别是,在弱(仅图像级标签可用)和半监督(1,464个细分蒙版)监督设置下,Pascal VOC 2012测试集上的mIoU分数分别达到60.8%和67.6%,这是新的状态。 -艺术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号