TextCycleGAN: Cyclical-Generative Adversarial Networks for Image Captioning

机译：TextCyclegan：用于图像标题的周期性对抗性网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we approach the problem of image captioning with cycle consistent generative adversarial networks (CycleGANs). Due to CycleGANs' ability to learn functions to map between multiple domains and use duality to strengthen each individual mapping with the usage of a cycle consistency loss, these models show great promise in their ability to learn both image captioning and image synthesis and to create a better image captioning framework. Historically, cycle consistency loss was based on the premise that the input should undergo little to no change when mapped to another domain and then back to its original; however, image captioning presents a unique challenge to this concept due to the many-to-many nature of the mapping from images to captions and vice-versa. TextCycleGAN overcomes this obstacle through utilization of cycle consistency in the feature space and is, thereby, able to perform well on both image captioning and synthesis. We will demonstrate its capability as an image captioning framework and discuss how its model architecture makes this possible.

机译：在这项研究中，我们接近循环一致生成的对抗网络（自行车）的图像标题的问题。由于Cractgans的能力学习功能在多个域之间映射并使用二元性来加强每个单独的映射，通过使用周期一致性损失，这些模型在他们学习图像标题和图像合成的能力中表现出很大的承诺并创建一个更好的图像标题框架。从历史上看，循环一致性损失是基于前提，即当映射到另一个领域时，输入应该几乎没有变化，然后返回原件;然而，由于从图像到标题的映射的多对多性质，图像字幕对该概念提出了独特的挑战，并反对。 TextCycleGan通过利用特征空间中的循环一致性克服了该障碍，从而能够在图像标题和合成中表现良好。我们将展示其作为图像标题框架的能力，并讨论其模型架构如何使其成为可能。

著录项

来源
《Conference on Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications》|2021年|117460Z.1-117460Z.8|共8页
会议地点
作者
Mohammad R. Alam; Nicole A. Isoda; Mitch C. Manzanares; Anthony C. Delgado; Antonius F.Panggabean;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image Captioning; Computer Vision; GAN; Image Synthesis; CycleGAN; Attention;

机译：图像标题;计算机视觉;甘;图像合成;compygan;注意力;

相似文献

外文文献
中文文献
专利

1. Multi-Attention Generative Adversarial Network for image captioning [J] . Neurocomputing . 2020,第Apra28期

机译：用于图像字幕的多注意力生成对抗网络
2. Interactions Guided Generative Adversarial Network for unsupervised image captioning [J] . Cao Shan, An Gaoyun, Zheng Zhenxing, Neurocomputing . 2020,第Deca5期

机译：用于无监督图像标题的相互作用导向生成对抗网络
3. Adversarial inpainting of MR images using deep adversarial networks [J] . K.Armanious, T.Küstner, B.Yang, Magma: Magnetic resonance materials in physics, biology, and medicine . 2019,第1Suppla1期

机译：利用深对抗网络对MR图像的对抗侵犯
4. A Novel Image Captioning Method Based on Generative Adversarial Networks [C] . Yang Fan, Jungang Xu, Yingfei Sun, International Conference on Artificial Neural Networks . 2019

机译：基于生成对抗网络的图像字幕新方法
5. Ensemble Learning on Deep Neural Networks for Image Caption Generation [D] . Katpally, Harshitha 2019

机译：在深度神经网络上进行集成学习以生成图像字幕
6. Deep learning approach to classification of lung cytological images: Two-step training using actual and synthesized images by progressive growing of generative adversarial networks [O] . Atsushi Teramoto, Tetsuya Tsukamoto, Ayumi Yamada, 2020

机译：深层学习方法肺细胞学图像分类：使用实际和合成图像的两步训练通过逐步生长的生长生长育种网络
7. BraIN: A Bidirectional Generative Adversarial Networks for image captions [O] . Yuhui Wang, Diane Cook 2020

机译：脑：用于图像标题的双向生成对抗网络

TextCycleGAN: Cyclical-Generative Adversarial Networks for Image Captioning

摘要

著录项

相似文献

相关主题

期刊订阅