首页> 外文会议> >Hierarchically-Fused Generative Adversarial Network for Text to Realistic Image Synthesis
【24h】

Hierarchically-Fused Generative Adversarial Network for Text to Realistic Image Synthesis

机译:用于文本到逼真的图像合成的分层融合生成对抗网络

获取原文

摘要

In this paper, we present a novel Hierarchically-fused Generative Adversarial Network (HfGAN) for synthesizing realistic images from text descriptions. While existing approaches on this topic have achieved impressive success, to generate 256×256 images from captions, they commonly resort to coarse-to-fine scheme and associate multiple discriminators in different stages of the networks. Such a strategy is both inefficient and prone to artifacts. Motivated by the above findings, we propose an end-to-end network that can generate 256×256 photo-realistic images with only one discriminator. We fully exploit the hierarchical information from different layers and directly generate the fine-scale images by adaptively fusing features from multi-hierarchical layers. We quantitatively evaluate the synthesized images with Inception Score, Visual-semantic Similarity and average training time on the CUB birds, Oxford-102 flowers, and COCO datasets. The results show that our model is more efficient and noticeably outperforms the previous state-of-the-art methods.
机译:在本文中,我们提出了一种新颖的层次融合生成对抗网络(HfGAN),用于从文本描述中合成逼真的图像。尽管有关此主题的现有方法取得了令人瞩目的成功,但可以从字幕生成256×256的图像,但它们通常诉诸于精细到精细的方案,并在网络的不同阶段关联了多个鉴别器。这样的策略效率低下并且容易产生伪像。基于以上发现,我们提出了一种端到端网络,该网络可以仅使用一个鉴别器就可以生成256×256的逼真图像。我们充分利用了来自不同层的层次信息,并通过自适应融合来自多层次层的特征来直接生成精细图像。我们对CUB鸟类,Oxford-102花朵和COCO数据集的初始得分,视觉语义相似度和平均训练时间进行定量评估,得出合成图像。结果表明,我们的模型效率更高,并且明显优于以前的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号