Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network

机译：分层嵌套对抗网络的摄影文本到图像合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel method to deal with the challenging task of generating photographic images conditioned on semantic image descriptions. Our method introduces accompanying hierarchical-nested adversarial objectives inside the network hierarchies, which regularize mid-level representations and assist generator training to capture the complex image statistics. We present an extensile single-stream generator architecture to better adapt the jointed discriminators and push generated images up to high resolutions. We adopt a multi-purpose adversarial loss to encourage more effective image and text information usage in order to improve the semantic consistency and image fidelity simultaneously. Furthermore, we introduce a new visual-semantic similarity measure to evaluate the semantic consistency of generated images. With extensive experimental validation on three public datasets, our method significantly improves previous state of the arts on all datasets over different evaluation metrics.

机译：本文提出了一种新颖的方法来处理以语义图像描述为条件的生成摄影图像的艰巨任务。我们的方法在网络层次结构内引入了伴随的层次结构嵌套的对抗目标，该目标规则化了中层表示并协助生成器训练以捕获复杂的图像统计信息。我们提出了一种可扩展的单流生成器体系结构，以更好地适应联合的鉴别器，并将生成的图像推高至高分辨率。我们采取了一种多用途对抗性攻击来鼓励更有效地使用图像和文本信息，以同时提高语义一致性和图像保真度。此外，我们引入了一种新的视觉语义相似性度量来评估生成图像的语义一致性。通过对三个公共数据集进行广泛的实验验证，我们的方法通过不同的评估指标显着改善了所有数据集的现有技术水平。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|6199-6208|共10页
会议地点 Salt Lake City(US)
作者
Zizhao Zhang; Yuanpu Xie; Lin Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Generators; Gallium nitride; Training; Image resolution; Task analysis; Semantics; Measurement;

机译：发电机；氮化镓训练;图像分辨率；任务分析；语义学测量;
入库时间 2022-08-26 14:35:28

相似文献

外文文献
中文文献
专利

1. MRP-GAN: Multi-resolution parallel generative adversarial networks for text-to-image synthesis [J] . Qi Zhongjian, Fan Chaogang, Xu Liangfeng, Pattern recognition letters . 2021,第Jula期

机译：MRP-GaN：用于文本到图像合成的多分辨率并行生成对抗网络
2. KT-GAN: Knowledge-Transfer Generative Adversarial Network for Text-to-Image Synthesis [J] . Hongchen Tan, Xiuping Liu, Meng Liu, IEEE Transactions on Image Processing . 2021,第1期

机译：KT-GaN：知识转移生成对抗网络，用于文本到图像合成
3. A survey on generative adversarial network-based text-to-image synthesis [J] . Zhou Rui, Jiang Cong, Xu Qingyang Neurocomputing . 2021,第Sepa3期

机译：基于生成的对抗网络文本到图像合成调查
4. Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network [C] . Zizhao Zhang, Yuanpu Xie, Lin Yang IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：使用分层嵌套的对冲网络进行摄影文本到图像合成
5. Generative Adversarial Networks for Image Synthesis [D] . Zhang, Han. 2019

机译：图像合成的对抗网络
6. Image synthesis of monoenergetic CT image in dual‐energy CT using kilovoltage CT with deep convolutional generative adversarial networks [O] . Daisuke Kawahara, Shuichi Ozawa, Tomoki Kimura, 2021

机译：用深卷积生成对抗网络使用千伏CT的双能CT中单能仪CT图像的图像合成
7. VocGAN: A High-Fidelity Real-Time Vocoder with a Hierarchically-Nested Adversarial Network [O] . Jinhyeok Yang, Junmo Lee, Youngik Kim, 2020

机译：损益：具有分层嵌套的对冲网络的高保真实时声码器

Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network

摘要

著录项

相似文献

相关主题

期刊订阅