Image generation from bounding box-represented semantic labels

Liu Congying; Yang Zexi; Xu Feng; Yong Jun-Hai

首页> 外文期刊>Computers & Graphics >Image generation from bounding box-represented semantic labels

【24h】

Image generation from bounding box-represented semantic labels

机译：从边界盒代表语义标签的图像生成

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image generation with pixel-wise semantic information is suitable for the development of adversarial learning techniques. In this study, we propose a method for synthesizing objects with class-specific textures and fine-scale details based on bounding box-represented semantic labels. To achieve this goal, we note that the traditional generative adversarial network (GAN) uses noise as an input to generate realistic images with sufficient textures and details, but it cannot be guided by specific targets and requirements. By contrast, conditional GAN (cGAN) can involve various types of guiding information but it often ignores specific textures and details, thereby leading to less realistic results and low resolution. Thus, we propose a new translator-enhancer framework by combining cGAN and GAN to achieve high quality image generation. cGAN is used as a translator to match the semantic constraints whereas GAN is employed as an enhancer to provide details and textures. We also propose a new form of semantic label map as an input, which is represented by instance-level bounding boxes rather than segmentation masks. The semantic label map represented by bounding boxes makes it easier for users to provide the inputs and it also gives greater flexibility when generating object boundaries. The results obtained from qualitative and quantitative experiments showed that our method can generate realistic images of objects with semantic labels represented by bounding boxes. Our method can be used to generate images of novel scenes to support learning tasks during training with various scenes, which are difficult to capture in the real world. (C) 2019 Elsevier Ltd. All rights reserved.

机译：图像生成具有像素明智的语义信息，适用于开发对抗性学习技术。在这项研究中，我们提出了一种基于边界盒代表的语义标记来合成具有专用纹理和微尺度细节的对象的方法。为了实现这一目标，我们注意到传统的生成对抗网络（GaN）使用噪声作为输入，以产生具有足够纹理和细节的现实图像，但不能被特定目标和要求引导。相比之下，条件GaN（Cgan）可以涉及各种类型的指导信息，但它通常忽略特定的纹理和细节，从而导致较差的结果和低分辨率。因此，我们通过组合Cgan和GaN来实现高质量的图像生成来提出新的翻译 - 增强者框架。 Cgan被用作转换器以匹配语义约束，而GAN被用作增强器，以提供细节和纹理。我们还提出了一种新的语义标签映射作为输入，它由实例级边界框而非分段掩码表示。边界框表示的语义标签映射使用户更容易提供输入，并且在生成对象边界时也会提供更大的灵活性。从定性和定量实验中获得的结果表明，我们的方法可以利用由边界框表示的语义标签产生物质图像。我们的方法可用于生成新颖的场景的图像，以支持使用各种场景的培训期间学习任务，这很难在现实世界中捕获。（c）2019 Elsevier Ltd.保留所有权利。

著录项

来源
《Computers & Graphics》 |2019年第6期|32-40|共9页
作者
Liu Congying; Yang Zexi; Xu Feng; Yong Jun-Hai;
展开▼
作者单位

Tsinghua Univ BNRist Beijing 100084 Peoples R China|Tsinghua Univ Sch Software Beijing 100084 Peoples R China;

Yi Tunnel Technol Co Ltd Beijing 100084 Peoples R China;

Tsinghua Univ BNRist Beijing 100084 Peoples R China|Tsinghua Univ Sch Software Beijing 100084 Peoples R China;

Tsinghua Univ BNRist Beijing 100084 Peoples R China|Tsinghua Univ Sch Software Beijing 100084 Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Computer vision; Generative adversarial networks; Image generation;

机译：计算机愿景;生成的对抗网络;图像生成;

相似文献

外文文献
中文文献
专利

1. Image generation from bounding box-represented semantic labels [J] . Liu Congying, Yang Zexi, Xu Feng, Computers & Graphics . 2019,第JUNa期

机译：从边界框表示的语义标签生成图像
2. SEMANTIC PHOTOGRAMMETRY – BOOSTING IMAGE-BASED 3D RECONSTRUCTION WITH SEMANTIC LABELING [J] . Stathopoulou E.-K., Remondino F. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2019,第1期

机译：语义照相术–使用语义标签引导基于图像的3D重建
3. Multi-scale Gated Fully Convolutional DenseNets for semantic labeling of historical newspaper images [J] . Soullard Yann, Tranouez Pierrick, Chatelain Clement, Pattern recognition letters . 2020,第Mara期

机译：多尺度门控完全卷积的历史报纸图像语义标签
4. Leveraging the Spatial Label Structure for Semantic Image Labeling using Random Forests [C] . Manuel Wollhaf, Ronny Hansch, Olaf Hellwich International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications . 2018

机译：利用随机林的语义图像标记的空间标签结构
5. Labeling Large Scale Image Datasets: Exploring Priors, Semantics and Scalability [D] . Jagadeesh, Vignesh 2013

机译：标记大型图像数据集：探索先验，语义和可扩展性
6. A Semantic Labeling Approach for Accurate Weed Mapping of High Resolution UAV Imagery [O] . Huasheng Huang, Yubin Lan, Jizhong Deng, 2018

机译：高分辨率无人机影像精确杂草制图的语义标注方法
7. MARRYING DEEP LEARNING AND DATA FUSION FOR ACCURATE SEMANTIC LABELING OF SENTINEL-2 IMAGES [O] . G. Fonteix, M. Swaine, M. Leras, 2021

机译：嫁给深度学习和数据融合，准确语义标记Sentinel-2图像
8. Unsupervised Semantic Labeling Framework for Identification of Complex Facilities in High-Resolution Remote Sensing Images. [R] . Vatsavai, R. R., Cheriyadat, A., Gleason, S. 2013

机译：高分辨率遥感影像中复杂设施识别的无监督语义标注框架。

Image generation from bounding box-represented semantic labels

摘要

著录项

相似文献

相关主题

期刊订阅