首页> 外文期刊>Neurocomputing >Image captioning with semantic-enhanced features and extremely hard negative examples
【24h】

Image captioning with semantic-enhanced features and extremely hard negative examples

机译:具有语义增强功能的图像标题和极其硬的否定例子

获取原文
获取原文并翻译 | 示例

摘要

Image captioning is a task to generate natural descriptions of images. In existing image captioning models, the generated captions usually lack semantic discriminability. Semantic discriminability is difficult as it requires the model to capture detailed differences in images. In this paper, we propose an image captioning framework with semantic-enhanced features and extremely hard negative examples. These two components are combined in a Semantic-Enhanced Module. The semantic-enhanced module consists of an image-text matching sub-network and a Feature Fusion layer which provides semantic-enhanced features of rich semantic information. Moreover, in order to improve the semantic discriminability, we propose an extremely hard negative mining method which utilize the extremely hard negative examples to improve the latent alignment between visual and language information. Experimental results on MSCOCO and Flickr30K show that our proposed framework and training method can simultaneously improve the performance of image-text matching and image captioning, achieving competitive performance against state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.
机译:图像标题是生成图像自然描述的任务。在现有的图像标题模型中,生成的标题通常缺乏语义可辨别性。语义辨别性很难,因为它需要模型来捕获图像的详细差异。在本文中,我们提出了一种具有语义增强特征和极其硬的否定示例的图像标题框架。这两个组件在语义增强模块中组合。语义增强模块由图像文本匹配子网络和一个特征融合层组成,它提供了丰富语义信息的语义增强功能。此外,为了提高语义可辨别性,我们提出了一种极其艰难的负挖掘方法,该方法利用极其硬的否定例子来改善视觉和语言信息之间的潜在对齐。 Mscoco和Flickr30K的实验结果表明,我们提出的框架和培训方法可以同时提高图像文本匹配和图像标题的性能,实现对最先进的方法的竞争性能。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2020年第6期|31-40|共10页
  • 作者

    Cai Wenjie; Liu Qiong;

  • 作者单位

    South China Univ Technol Sch Software Engn Guangzhou Peoples R China;

    South China Univ Technol Sch Software Engn Guangzhou Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Image captioning; Image-text matching; Negative examples;

    机译:图像标题;图像文本匹配;否定例子;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号