首页> 外文会议> >Hierarchically-Fused Generative Adversarial Network for Text to Realistic Image Synthesis

【24h】

Hierarchically-Fused Generative Adversarial Network for Text to Realistic Image Synthesis

机译：用于文本到逼真的图像合成的分层融合生成对抗网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a novel Hierarchically-fused Generative Adversarial Network (HfGAN) for synthesizing realistic images from text descriptions. While existing approaches on this topic have achieved impressive success, to generate 256×256 images from captions, they commonly resort to coarse-to-fine scheme and associate multiple discriminators in different stages of the networks. Such a strategy is both inefficient and prone to artifacts. Motivated by the above findings, we propose an end-to-end network that can generate 256×256 photo-realistic images with only one discriminator. We fully exploit the hierarchical information from different layers and directly generate the fine-scale images by adaptively fusing features from multi-hierarchical layers. We quantitatively evaluate the synthesized images with Inception Score, Visual-semantic Similarity and average training time on the CUB birds, Oxford-102 flowers, and COCO datasets. The results show that our model is more efficient and noticeably outperforms the previous state-of-the-art methods.

机译：在本文中，我们提出了一种新颖的层次融合生成对抗网络（HfGAN），用于从文本描述中合成逼真的图像。尽管有关此主题的现有方法取得了令人瞩目的成功，但可以从字幕生成256×256的图像，但它们通常诉诸于精细到精细的方案，并在网络的不同阶段关联了多个鉴别器。这样的策略效率低下并且容易产生伪像。基于以上发现，我们提出了一种端到端网络，该网络可以仅使用一个鉴别器就可以生成256×256的逼真图像。我们充分利用了来自不同层的层次信息，并通过自适应融合来自多层次层的特征来直接生成精细图像。我们对CUB鸟类，Oxford-102花朵和COCO数据集的初始得分，视觉语义相似度和平均训练时间进行定量评估，得出合成图像。结果表明，我们的模型效率更高，并且明显优于以前的最新方法。

著录项

来源
《》|2019年|73-80|共8页
会议地点
作者
Xin Huang; Mingjie Wang; Minglun Gong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Generative adversarial networks; Generators; Feature extraction; Training; Image resolution; Image synthesis; Birds;

机译：生成对抗网络生成器特征提取训练图像分辨率图像合成鸟类;

相似文献

外文文献
中文文献
专利

1. Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network [J] . Wang M., Lang C., Feng S., Journal of visual communication & image representation . 2021,第Jana期

机译：通过链接的深度经常性发生的对抗网络来发送给照片 - 现实图像合成文本
2. A Realistic Image Generation of Face From Text Description Using the Fully Trained Generative Adversarial Networks [J] . Muhammad Zeeshan Khan, Saira Jabeen, Muhammad Usman Ghani Khan, Quality Control, Transactions . 2021,第1期

机译：使用完全训练的生成对冲网络从文本描述中逼真的脸部生成
3. Attentional Generative Adversarial Networks With Representativeness and Diversity for Generating Text to Realistic Image [J] . Tian Anjie, Lu Lu Quality Control, Transactions . 2020,第期

机译：具有代表性和多样性的引入生成对抗网络，用于生成文本到现实形象
4. Hierarchically-Fused Generative Adversarial Network for Text to Realistic Image Synthesis [C] . Xin Huang, Mingjie Wang, Minglun Gong Conference on Computer and Robot Vision . 2019

机译：用于现实图像合成的文本的分层融合生成的对抗网络
5. Generative Adversarial Networks for Image Synthesis [D] . Zhang, Han. 2019

机译：图像合成的对抗网络
6. Generative Adversarial Networks for the Creation of Realistic Artificial Brain Magnetic Resonance Images [O] . Koshino Kazuhiro, Rudolf A. Werner, Fujio Toriumi, 2018

机译：生成对抗性网络用于创建逼真的人工脑磁共振图像
7. StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks [O] . Zhang, Han, Xu, Tao, Li, Hongsheng, 2017

机译：stackGaN：使用stacked实现逼真的图像合成文本生成性对抗网络

Hierarchically-Fused Generative Adversarial Network for Text to Realistic Image Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅