首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation
【24h】

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation

机译:重新审视扩张的卷积:弱和半监督语义细分的简单方法

获取原文

摘要

Despite the remarkable progress, weakly supervised segmentation approaches are still inferior to their fully supervised counterparts. We obverse the performance gap mainly comes from their limitation on learning to produce high-quality dense object localization maps from image-level supervision. To mitigate such a gap, we revisit the dilated convolution [1] and reveal how it can be utilized in a novel way to effectively overcome this critical limitation of weakly supervised segmentation approaches. Specifically, we find that varying dilation rates can effectively enlarge the receptive fields of convolutional kernels and more importantly transfer the surrounding discriminative information to non-discriminative object regions, promoting the emergence of these regions in the object localization maps. Then, we design a generic classification network equipped with convolutional blocks of different dilated rates. It can produce dense and reliable object localization maps and effectively benefit both weakly- and semi- supervised semantic segmentation. Despite the apparent simplicity, our proposed approach obtains superior performance over state-of-the-arts. In particular, it achieves 60.8% and 67.6% mIoU scores on Pascal VOC 2012 test set in weakly- (only image-level labels are available) and semi- (1,464 segmentation masks are available) supervised settings, which are the new state-of-the-arts.
机译:尽管进展显着,但弱势监督的分割方法仍然不如他们的全面监督的对应物。我们正常的性能差距主要来自他们对学习的限制,从图像级监督产生高质量的密集物质定位地图。为了减轻这种差距,我们重新审视扩张的卷积[1]并揭示如何以新颖的方式利用,以有效地克服弱监督分割方法的这种关键限制。具体地,我们发现不同的扩张速率可以有效地扩大卷积核的接受领域,更重要的是将周围的鉴别信息转移到非歧视性对象区域,促进对象定位地图中这些区域的出现。然后,我们设计一个配备不同扩张速率的卷积块的通用分类网络。它可以产生密集和可靠的对象定位地图,有效地利用弱和半监督的语义分割。尽管简单的简单性,但我们所提出的方法可以获得最先进的卓越性能。特别是,它在弱 - (只有图像级标签中有效的Pascal VOC 2012年测试集上达到60.8%和67.6%的Miou分数,并且可以获得半(1,464个分割掩模)监督设置,这是新的状态-艺术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号