首页> 外文会议>Conference on Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications >TextCycleGAN: Cyclical-Generative Adversarial Networks for Image Captioning
【24h】

TextCycleGAN: Cyclical-Generative Adversarial Networks for Image Captioning

机译:TextCyclegan:用于图像标题的周期性对抗性网络

获取原文

摘要

In this study, we approach the problem of image captioning with cycle consistent generative adversarial networks (CycleGANs). Due to CycleGANs' ability to learn functions to map between multiple domains and use duality to strengthen each individual mapping with the usage of a cycle consistency loss, these models show great promise in their ability to learn both image captioning and image synthesis and to create a better image captioning framework. Historically, cycle consistency loss was based on the premise that the input should undergo little to no change when mapped to another domain and then back to its original; however, image captioning presents a unique challenge to this concept due to the many-to-many nature of the mapping from images to captions and vice-versa. TextCycleGAN overcomes this obstacle through utilization of cycle consistency in the feature space and is, thereby, able to perform well on both image captioning and synthesis. We will demonstrate its capability as an image captioning framework and discuss how its model architecture makes this possible.
机译:在这项研究中,我们接近循环一致生成的对抗网络(自行车)的图像标题的问题。由于Cractgans的能力学习功能在多个域之间映射并使用二元性来加强每个单独的映射,通过使用周期一致性损失,这些模型在他们学习图像标题和图像合成的能力中表现出很大的承诺并创建一个更好的图像标题框架。从历史上看,循环一致性损失是基于前提,即当映射到另一个领域时,输入应该几乎没有变化,然后返回原件;然而,由于从图像到标题的映射的多对多性质,图像字幕对该概念提出了独特的挑战,并反对。 TextCycleGan通过利用特征空间中的循环一致性克服了该障碍,从而能够在图像标题和合成中表现良好。我们将展示其作为图像标题框架的能力,并讨论其模型架构如何使其成为可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号