...
首页> 外文期刊>Neurocomputing >Interactions Guided Generative Adversarial Network for unsupervised image captioning
【24h】

Interactions Guided Generative Adversarial Network for unsupervised image captioning

机译:用于无监督图像标题的相互作用导向生成对抗网络

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Most of the current image captioning models that have achieved great successes heavily depend on manually labeled image-caption pairs. However, it is expensive and time-consuming to acquire large scale paired data. In this paper, we propose the Interactions Guided Generative Adversarial Network (IGGAN) for unsupervised image captioning, which joints multi-scale feature representation and object-object interactions. To get robust feature representation, the image is encoded by ResNet with a new Multi-scale module and adaptive Channel attention (RMCNet). Moreover, the information on object-object interactions is extracted by our Mutual Attention Network (MAN) and then adopted in the process of adversarial generation, which enhances the rationality of generated sentences. To encourage the sentence to be semantically consistent with the image, we utilize the image and generated sentence to reconstruct each other by cycle consistency in IGGAN. Our proposed model can generate sentences without any manually labeled image-caption pairs. Experimental results show that our proposed model achieves quite promising performance on the MSCOCO image captioning dataset. The ablation studies validate the effectiveness of our proposed modules. (C) 2020 Elsevier B.V. All rights reserved.
机译:大多数目前的图像标题模型都取得了巨大成功,依赖于手动标记的图像标题对。然而,获取大规模配对数据昂贵且耗时。在本文中,我们提出了针对无监督图像标题的相互作用导向生成的对抗网络(IGGAN),其关节多尺度特征表示和对象对象交互。为了获得强大的特征表示,图像由Reset编码,具有新的多尺度模块和自适应信道注意(RMCNet)。此外,关于对象对象交互的信息由我们的相互关注网络(人)提取,然后在对抗的过程中采用,从而提高了所生成的句子的合理性。为了鼓励句子与图像进行语义一致,我们利用图像和生成的句子通过IGGAN中的循环一致性来互相重建。我们所提出的模型可以生成句子而没有任何手动标记的图像标题对。实验结果表明,我们提出的模型在MSCOCO图像标题数据集上实现了非常有希望的性能。消融研究验证了我们提出的模块的有效性。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号