Paired-D GAN for Semantic Image Synthesis

机译：配对D GAN用于语义图像合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic image synthesis is to render foreground (object) given as a text description into a given source image. This has a wide range of applications such as intelligent image manipulation, and is helpful to those who are not good at painting. We propose a generative adversarial network having a pair of discriminators with different architectures, called Paired-D CAN, for semantic image synthesis where the two discriminators make different judgments: one for foreground synthesis and the other for background synthesis. The generator of paired-D GAN has the encoder-decoder architecture with skip-connections and synthesizes an image matching the given text description while preserving other parts of the source image. The two discriminators judge foreground and background of the synthesized image separately to meet an input text description and a source image. The paired-D CAN is trained using the effective adversarial learning process in a simultaneous three-player minimax game. Experimental results on the Caltech-200 bird dataset and the Oxford-102 flower dataset show that Paired-GAN is capable of semantically synthesizing images to match an input text description while retaining the background in a source image against the state-of-the-art methods.

机译：语义图像合成是将作为文本描述给出的前景（对象）渲染到给定的源图像中。这具有广泛的应用，例如智能图像处理，并且对那些不擅长绘画的人有帮助。我们提出了一个生成对抗网络，该网络具有一对具有不同体系结构的鉴别器，称为Paired-D CAN，用于语义图像合成，其中两个鉴别器做出不同的判断：一个用于前景合成，另一个用于背景合成。配对D GAN的生成器具有带跳过连接的编码器-解码器体系结构，并在保留源图像其他部分的同时，合成与给定文本描述匹配的图像。这两个鉴别器分别判断合成图像的前景和背景，以满足输入文本描述和源图像。配对的D CAN可以在三人同时进行的minimax游戏中使用有效的对抗性学习过程进行训练。在Caltech-200鸟类数据集和Oxford-102花卉数据集上的实验结果表明，Paired-GAN能够语义合成图像以匹配输入文本描述，同时将背景图像保留在最新图像中方法。

著录项

来源
《Asian Conference on Computer Vision》|2018年|468-484|共17页
会议地点
作者
Duc Minn Vo; Akihiro Sugimoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. FISS GAN: A Generative Adversarial Network for Foggy Image Semantic Segmentation [J] . Kunhua Liu, Zihao Ye, Hongyan Guo, 自动化学报（英文版） . 2021,第008期
2. FISS GAN:A Generative Adversarial Network for Foggy Image Semantic Segmentation [J] . Kunhua Liu, Zihao Ye, Hongyan Guo, 自动化学报：英文版 . 2021,第008期
3. GAN-Based virtual-to-real image translation for urban scene semantic segmentation [J] . Guo Xi, Wang Zhicheng, Yang Qin, Neurocomputing . 2020,第Juna21期

机译：基于GaN的虚拟到实图像翻译城市场景语义细分
4. Paired-D GAN for Semantic Image Synthesis [C] . Duc Minh Vo, Akihiro Sugimoto Asian Conference on Computer Vision . 2019

机译：配对-D GaN用于语义图像合成
5. Generating fMRI Images of Neural Activations Using Ranked Voxel Mask and Volumetric Conditional Gan [D] . Chitroda, Divyesh. 2020

机译：使用排名的体素掩模和体积条件GaN产生神经激活的FMRI图像
6. CT-Based Pelvic T1-Weighted MR Image Synthesis Using UNet UNet++ and Cycle-Consistent Generative Adversarial Network (Cycle-GAN) [O] . Reza Kalantar, Christina Messiou, Jessica M. Winfield, 2021

机译：基于CT的骨盆T1加权MR图像合成使用UNETUNET ++和循环一致的生成对抗网络（CYCLE-GAN）
7. Dual Attention GANs for Semantic Image Synthesis [O] . Hao Tang, Song Bai, Nicu Sebe 2020

机译：对语义图像合成的双重关注GANS
8. Transport Imaging for Contact-Free Measurements of Minority Carrier Diffusion in GaN, GaN/AlGaN, and GaN/InGaN Core-Shell Nanowires. [R] . A. A. Talin C. P. Ong L. Baird N. M. Haegel R. A. Cole 2011

机译：用于GaN，GaN / alGaN和GaN / InGaN核壳纳米线中少数载流子扩散的无接触测量的传输成像。

Paired-D GAN for Semantic Image Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅