首页> 外文期刊>Information Processing & Management >Adversarial network integrating dual attention and sparse representation for semi-supervised semantic segmentation
【24h】

Adversarial network integrating dual attention and sparse representation for semi-supervised semantic segmentation

机译:对抗半监督语义分割的反对网络集成了双重关注和稀疏表示

获取原文
获取原文并翻译 | 示例
       

摘要

Semantic segmentation is the important task of assigning a semantic label to each pixel. However, semantic segmentation based on the deep neural network usually requires massive annotations consumption to acquire better performance. To avoid the problem, some algorithms based on weakly-supervised and semi-supervised conditions have been proposed and achieved gradually improving performance in recent years. In this paper, we propose a novel semi-supervised adversarial network to alleviate the shortage of labeled data, which only requires a few labeled images to get competitive performance. The model is composed of two parts: the segmentation network and the discriminator network. The first part aims to semantically generate a segmented result that has the same size as the input color image. The discriminator network is designed in a fully convolutional manner to distinguish the predicted probability maps depending on the ground truth distribution. In particular, the probability maps are regarded as focal attention maps, which are fed back to the segmentation network to make the model converge faster, and the process can induce the model to focus on pixels that are hard to segment. To enhance the representation ability of image features, sparse representation and dual attention are adopted in the segmentation network. The sparse representation module aims to emphasize the object edges and locations by learning the convolutional sparse representation of the input color images, and the dual attention module can exploit the semantic interdependencies in two different dimensions. Moreover, the semi-supervised mechanism is introduced to the network, in which the adaptive parameter T that controls the sensitivity of the self-taught phase is proposed, and the training dataset is split into two parts for fully-supervised learning and semi-supervised learning. Specifically, the first part is unlabeled data, which is applied to provide supervised signals for semi-supervised training. The labeled data drawn from the other part is utilized for fully-supervised learning. Our semi-supervised adversarial framework can improve the learning ability and achieve higher performance, also, provide a novel approach to tackle the semantic segmentation task. Finally, comprehensive experiments on the PASCAL VOC 2012 and Cityscapes datasets are conducted to verify the effectiveness of the proposed model, which achieves the expected performance.
机译:语义分割是将语义标签分配给每个像素的重要任务。然而,基于深度神经网络的语义分割通常需要巨大的注释消耗来获得更好的性能。为了避免问题,已经提出了一些基于弱监督和半监督条件的算法,并逐步提高了近年来的性能。在本文中,我们提出了一种新型半监督的对抗性网络来缓解标记数据的短缺,这只需要一些标记的图像来获得竞争性能。该模型由两部分组成:分段网络和鉴别器网络。第一部分旨在语义生成与输入颜色图像相同的分段结果。鉴别器网络以完全卷积的方式设计,以根据地面事实分布区分预测的概率图。特别地,概率图被视为焦虑映射,其被反馈到分割网络以使模型更快地收敛,并且该过程可以诱导模型聚焦难以段的像素。为了增强图像特征的表示能力,分段网络采用稀疏表示和双重关注。稀疏表示模块旨在通过学习输入彩色图像的卷积稀疏表示来强调对象边缘和位置,并且双重注意模块可以利用两个不同的尺寸来利用语义相互依赖性。此外,提出了半监督机制,其中提出了控制自学阶段灵敏度的自适应参数T,并且训练数据集分为两个部分以进行全监督的学习和半监督学习。具体地,第一部分是未标记的数据,其应用于为半监督培训提供监督信号。从另一部分绘制的标记数据用于完全监督的学习。我们的半监督对冲框架可以提高学习能力,实现更高的性能,也提供了一种解决语义细分任务的新方法。最后,对Pascal VOC 2012和Citycapes数据集进行了全面的实验,以验证所提出的模型的有效性,从而实现了预期的性能。

著录项

  • 来源
    《Information Processing & Management》 |2021年第5期|102680.1-102680.19|共19页
  • 作者

    Ge Jin; Chuancai Liu; Xu Chen;

  • 作者单位

    School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing 210094 China;

    School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing 210094 China Collaborative Innovation Center of IoT Technology and Intelligent Systems Minjiang University Fuzhou 350108 China;

    School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing 210094 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Semi-supervised learning; Focal attention map; Dual attention; Sparse representation; Adversarial network;

    机译:半监督学习;焦点地图;双重关注;稀疏表示;对抗网络;
  • 入库时间 2022-08-19 02:25:57

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号