Fully Convolutional Adaptation Networks for Semantic Segmentation

机译：用于语义分割的全卷积自适应网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recent advances in deep neural networks have convincingly demonstrated high capability in learning vision models on large datasets. Nevertheless, collecting expert labeled datasets especially with pixel-level annotations is an extremely expensive process. An appealing alternative is to render synthetic data (e.g., computer games) and generate ground truth automatically. However, simply applying the models learnt on synthetic images may lead to high generalization error on real images due to domain shift. In this paper, we facilitate this issue from the perspectives of both visual appearance-level and representation-level domain adaptation. The former adapts source-domain images to appear as if drawn from the 'style' in the target domain and the latter attempts to learn domain-invariant representations. Specifically, we present Fully Convolutional Adaptation Networks (FCAN), a novel deep architecture for semantic segmentation which combines Appearance Adaptation Networks (AAN) and Representation Adaptation Networks (RAN). AAN learns a transformation from one domain to the other in the pixel space and RAN is optimized in an adversarial learning manner to maximally fool the domain discriminator with the learnt source and target representations. Extensive experiments are conducted on the transfer from GTA5 (game videos) to Cityscapes (urban street scenes) on semantic segmentation and our proposal achieves superior results when comparing to state-of-the-art unsupervised adaptation techniques. More remarkably, we obtain a new record: mIoU of 47.5% on BDDS (drive-cam videos) in an unsupervised setting.

机译：深度神经网络的最新进展令人信服地证明了在大型数据集上学习视觉模型的高能力。但是，收集专家标记的数据集，尤其是使用像素级注释的数据集，是非常昂贵的过程。一个吸引人的选择是渲染合成数据（例如，计算机游戏）并自动生成基本事实。但是，由于域偏移，仅将在合成图像上学习的模型应用于实际图像可能会导致在真实图像上出现很高的泛化误差。在本文中，我们从视觉外观级别和表示级别的域自适应的角度来促进这一问题的解决。前者将源域图像改编为看起来像是从目标域中的“样式”绘制的，而后者则尝试学习域不变表示。具体来说，我们提出了完全卷积自适应网络（FCAN），这是一种新颖的语义分割深层架构，它结合了外观自适应网络（AAN）和表示自适应网络（RAN）。 AAN在像素空间中学习从一个域到另一个域的转换，并且以对抗性学习方式对RAN进行优化，以最大程度地使域识别符与学习到的源表示和目标表示相混淆。在从GTA5（游戏视频）到Cityscapes（城市街道场景）的语义分割上进行了广泛的实验，与最先进的无监督自适应技术相比，我们的建议取得了更好的结果。更值得注意的是，我们获得了新的记录：在无人监督的情况下，BDDS（驱动器摄像头视频）的mIoU为47.5％。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|6810-6818|共9页
会议地点 Salt Lake City(US)
作者
Yiheng Zhang; Zhaofan Qiu; Ting Yao; Dong Liu; Tao Mei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Image segmentation; Adaptation models; Visualization; Task analysis; Games; Videos;

机译：语义学图像分割适应模型；可视化；任务分析；游戏；影片;
入库时间 2022-08-26 14:35:30

相似文献

外文文献
中文文献
专利

1. Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions [J] . Duc My Vo, Lee Sang-Woong Multimedia Tools and Applications . 2018,第14期

机译：使用具有多尺度图像和多尺度扩张卷积的全卷积神经网络进行语义图像分割
2. Weakly Supervised Learning with Deep Convolutional Neural Networks for Semantic Segmentation: Understanding Semantic Layout of Images with Minimum Human Supervision [J] . Seunghoon Hong, Suha Kwak, Bohyung Han IEEE Signal Processing Magazine . 2017,第6期

机译：使用深度卷积神经网络进行语义监督的弱监督学习：以最少的人工监督了解图像的语义布局
3. Apple Tree Trunk and Branch Segmentation for Automatic Trellis Training Using Convolutional Neural Network Based Semantic Segmentation [J] . Yaqoob Majeed, Jing Zhang, Xin Zhang, IFAC PapersOnLine . 2018,第17期

机译：基于卷积神经网络的语义分割，用于苹果树树干和树枝的自动网格训练
4. Fully Convolutional Adaptation Networks for Semantic Segmentation [C] . Yiheng Zhang, Zhaofan Qiu, Ting Yao, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：全卷积适应网络用于语义分割
5. Automated Segmentation and Uncertainty Quantification for Image-Based Cardiovascular Modeling with Convolutional Neural Networks [D] . Maher, Gabriel. 2020

机译：卷积神经网络的图像基础心血管建模的自动分割和不确定量化
6. Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images [O] . Baohua Qiang, Ruidong Chen, Mingliang Zhou, 2020

机译：基于卷积神经网络的对象检测算法通过连接图像的语义分割
7. Fully Convolutional Adaptation Networks for Semantic Segmentation [O] . Yiheng Zhang, Zhaofan Qiu, Ting Yao, 2018

机译：全卷积适应网络用于语义分割

Fully Convolutional Adaptation Networks for Semantic Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅