首页> 外文会议>Asian Conference on Computer Vision >Paired-D GAN for Semantic Image Synthesis
【24h】

Paired-D GAN for Semantic Image Synthesis

机译:配对D GAN用于语义图像合成

获取原文

摘要

Semantic image synthesis is to render foreground (object) given as a text description into a given source image. This has a wide range of applications such as intelligent image manipulation, and is helpful to those who are not good at painting. We propose a generative adversarial network having a pair of discriminators with different architectures, called Paired-D CAN, for semantic image synthesis where the two discriminators make different judgments: one for foreground synthesis and the other for background synthesis. The generator of paired-D GAN has the encoder-decoder architecture with skip-connections and synthesizes an image matching the given text description while preserving other parts of the source image. The two discriminators judge foreground and background of the synthesized image separately to meet an input text description and a source image. The paired-D CAN is trained using the effective adversarial learning process in a simultaneous three-player minimax game. Experimental results on the Caltech-200 bird dataset and the Oxford-102 flower dataset show that Paired-GAN is capable of semantically synthesizing images to match an input text description while retaining the background in a source image against the state-of-the-art methods.
机译:语义图像合成是将作为文本描述给出的前景(对象)渲染到给定的源图像中。这具有广泛的应用,例如智能图像处理,并且对那些不擅长绘画的人有帮助。我们提出了一个生成对抗网络,该网络具有一对具有不同体系结构的鉴别器,称为Paired-D CAN,用于语义图像合成,其中两个鉴别器做出不同的判断:一个用于前景合成,另一个用于背景合成。配对D GAN的生成器具有带跳过连接的编码器-解码器体系结构,并在保留源图像其他部分的同时,合成与给定文本描述匹配的图像。这两个鉴别器分别判断合成图像的前景和背景,以满足输入文本描述和源图像。配对的D CAN可以在三人同时进行的minimax游戏中使用有效的对抗性学习过程进行训练。在Caltech-200鸟类数据集和Oxford-102花卉数据集上的实验结果表明,Paired-GAN能够语义合成图像以匹配输入文本描述,同时将背景图像保留在最新图像中方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号