GAN-Based virtual-to-real image translation for urban scene semantic segmentation

Guo Xi; Wang Zhicheng; Yang Qin; Lv Weifeng; Liu Xianglong; Wu Qiong; Huang Jian

首页> 外文期刊>Neurocomputing >GAN-Based virtual-to-real image translation for urban scene semantic segmentation

【24h】

GAN-Based virtual-to-real image translation for urban scene semantic segmentation

机译：基于GaN的虚拟到实图像翻译城市场景语义细分

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic image segmentation requires large amounts of pixel-wise labeled training data. Creating such data generally requires labor-intensive human manual annotation. Thus, extracting training data from video games is a practical idea, and pixel-wise annotation can be automated from video games with near perfect accuracy. However, experiments show that models trained using raw video-game data cannot be directly applied to real-world scenes because of the domain shift problem. In this paper, we propose a domain-adaptive network based on CycleGAN that translates scenes from a virtual domain to a real domain in both the pixel and feature spaces. Our contributions are threefold: 1) we propose a dynamic perceptual network to improve the quality of the generated images in the feature spaces, making the translated images are more conducive to semantic segmentation; 2) we introduce a novel weighted self-regularization loss to prevent semantic changes in translated images; and 3) we design a discrimination mechanism to coordinate multiple subnetworks and improve the overall training efficiency. We devise a series of metrics to evaluate the quality of translated images during our experiments on the public GTA-V (a video game dataset, i.e., the virtual domain) and Cityscapes (a real-world dataset, i.e., the real domain) and achieved notably improved results, demonstrating the efficacy of the proposed model. (C) 2019 Elsevier B.V. All rights reserved.

机译：语义图像分割需要大量的像素标记标记的训练数据。创建此类数据通常需要劳动密集型人类手册注释。因此，从视频游戏中提取培训数据是一个实际的想法，并且像素 - 明亮的注释可以自动从视频游戏中自动，靠近完美的准确性。然而，实验表明，由于域移位问题，使用原始视频游戏数据培训的模型不能直接应用于现实世界场景。在本文中，我们提出了一种基于Cycleangan的域 - 自适应网络，该网络可将场景从虚拟域转换为像素和特征空间中的真实域。我们的贡献是三倍：1）我们提出了一种动态感知网络，以提高特征空间中所生成的图像的质量，使翻译的图像更有利于语义分割; 2）我们介绍了一种小说加权自正规化损失，以防止翻译图像中的语义变化; 3）我们设计了一个歧视机制，以协调多个子网，提高整体培训效率。我们设计了一系列指标，以评估我们在公共GTA-V（视频游戏数据集，虚拟域）和城市景观（真实世界数据集，即真实域）和CityCapes的实验期间评估翻译图像的质量达到了显着改善的结果，展示了所提出的模型的功效。（c）2019 Elsevier B.v.保留所有权利。

著录项

来源
《Neurocomputing》 |2020年第21期|127-135|共9页
作者
Guo Xi; Wang Zhicheng; Yang Qin; Lv Weifeng; Liu Xianglong; Wu Qiong; Huang Jian;
展开▼
作者单位

State Key Lab Software Dev Environm Beijing 100191 Peoples R China;

Beihang Univ Coll Software Beijing 100191 Peoples R China;

State Key Lab Software Dev Environm Beijing 100191 Peoples R China;

State Key Lab Software Dev Environm Beijing 100191 Peoples R China;

State Key Lab Software Dev Environm Beijing 100191 Peoples R China;

Beijing Innovat Ctr Mobil Intelligent Beijing Peoples R China|China Transinfo Technol Corp Beijing Peoples R China;

State Key Lab Software Dev Environm Beijing 100191 Peoples R China|Beihang Univ Coll Software Beijing 100191 Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Domain adaptation; Virtual-to-real image translation; Generative adversarial networks; Semantic segmentation; Deep convolutional neural networks;

机译：域适应;虚拟到真实的图像翻译;生成的对抗网络;语义分割;深卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes [J] . Dong Genshun, Yan Yan, Shen Chunhua, IEEE Transactions on Intelligent Transportation Systems . 2021,第6期

机译：城市街景的实时高性能语义图像分割
2. Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation [J] . Gupta Saurabh, Arbelaez Pablo, Girshick Ross, International Journal of Computer Vision . 2015,第2期

机译：使用RGB-D图像了解室内场景：自底向上分割，对象检测和语义分割
3. Fusion of images and point clouds for the semantic segmentation of large- scale 3D scenes based on deep learning [J] . Rui Zhang, Guangyun Li, Minglei Li, ISPRS Journal of Photogrammetry and Remote Sensing . 2018,第SEPa期

机译：基于深度学习的图像和点云融合用于大规模3D场景的语义分割
4. Semantic-Aware Unpaired Image-to-Image Translation for Urban Scene Images [C] . Zongyao Li, Ren Togo, Takahiro Ogawa, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：对城市场景图像的语义感知未配对的图像到图像转换
5. Visual Scene Understanding through Semantic Segmentation. [D] . Singh, Gautam. 2014

机译：通过语义分割了解视觉场景。
6. Affiliated Fusion Conditional Random Field for Urban UAV Image Semantic Segmentation [O] . Yingying Kong, Bowen Zhang, Biyuan Yan, 2020

机译：城市无人机图像语义分割的关联融合条件随机场
7. Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes [O] . Genshun Dong, Yan Yan, Chunhua Shen, 2021

机译：城市街景的实时高性能语义图像分割

GAN-Based virtual-to-real image translation for urban scene semantic segmentation

摘要

著录项

相似文献

相关主题

期刊订阅