首页> 外文期刊>Circuits and Systems for Video Technology, IEEE Transactions on >Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis
【24h】

Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis

机译:Bridge-GaN:文本到图像合成的可解释表示学习

获取原文
获取原文并翻译 | 示例
       

摘要

Text-to-image synthesis is to generate images with the consistent content as the given text description, which is a highly challenging task with two main issues: visual reality and content consistency. Recently, it is available to generate images with high visual reality due to the significant progress of generative adversarial networks. However, translating text description to image with high content consistency is still ambitious. For addressing the above issues, it is reasonable to establish a transitional space with interpretable representation as a bridge to associate text and image. So we propose a text-to-image synthesis approach named Bridge-like Generative Adversarial Networks (Bridge-GAN). Its main contributions are: (1) A transitional space is established as a bridge for improving content consistency, where the interpretable representation can be learned by guaranteeing the key visual information from given text descriptions. (2) A ternary mutual information objective is designed for optimizing the transitional space and enhancing both the visual reality and content consistency. It is proposed under the goal to disentangle the latent factors conditioned on text description for further interpretable representation learning. Comprehensive experiments on two widely-used datasets verify the effectiveness of our Bridge-GAN with the best performance.
机译:文本到图像合成是生成具有一致内容的图像,作为给定的文本描述,这是一个具有两个主要问题的高度具有挑战性的任务:视觉现实内容一致性。最近,由于生成的对抗性网络的显着进展,它可用于产生具有高视觉现实的图像。但是,将文本描述转换为具有高内容一致性的图像仍然雄心勃勃。为了解决上述问题,建立具有可解释表示的过渡空间是合理的,作为联合文本和图像的桥梁。因此,我们提出了一个名为Bridge Denerative Profersarial Networks(Bridge-GaN)的文本到图像综合方法。其主要贡献是:(1)过渡空间是建立为改善的桥梁内容一致性,可以通过保证来自给定文本描述的关键视觉信息来学习可解释的表示。 (2)三元相互信息目标旨在优化过渡空间并增强两个视觉现实内容一致性。在目标下提出了解开潜在因素的潜在因素,以便进一步解释的代表学习。两个广泛使用的数据集的综合实验验证了我们的桥GAN的有效性,具有最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号