首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network
【24h】

Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network

机译:分层嵌套对抗网络的摄影文本到图像合成

获取原文

摘要

This paper presents a novel method to deal with the challenging task of generating photographic images conditioned on semantic image descriptions. Our method introduces accompanying hierarchical-nested adversarial objectives inside the network hierarchies, which regularize mid-level representations and assist generator training to capture the complex image statistics. We present an extensile single-stream generator architecture to better adapt the jointed discriminators and push generated images up to high resolutions. We adopt a multi-purpose adversarial loss to encourage more effective image and text information usage in order to improve the semantic consistency and image fidelity simultaneously. Furthermore, we introduce a new visual-semantic similarity measure to evaluate the semantic consistency of generated images. With extensive experimental validation on three public datasets, our method significantly improves previous state of the arts on all datasets over different evaluation metrics.
机译:本文提出了一种新颖的方法来处理以语义图像描述为条件的生成摄影图像的艰巨任务。我们的方法在网络层次结构内引入了伴随的层次结构嵌套的对抗目标,该目标规则化了中层表示并协助生成器训练以捕获复杂的图像统计信息。我们提出了一种可扩展的单流生成器体系结构,以更好地适应联合的鉴别器,并将生成的图像推高至高分辨率。我们采取了一种多用途对抗性攻击来鼓励更有效地使用图像和文本信息,以同时提高语义一致性和图像保真度。此外,我们引入了一种新的视觉语义相似性度量来评估生成图像的语义一致性。通过对三个公共数据集进行广泛的实验验证,我们的方法通过不同的评估指标显着改善了所有数据集的现有技术水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号