首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Context-Aware Group Captioning via Self-Attention and Contrastive Features
【24h】

Context-Aware Group Captioning via Self-Attention and Contrastive Features

机译:通过自我注意和对比功能识别上下文的群组字幕

获取原文
获取外文期刊封面目录资料

摘要

While image captioning has progressed rapidly, existing works focus mainly on describing single images. In this paper, we introduce a new task, context-aware group captioning, which aims to describe a group of target images in the context of another group of related reference images. Context-aware group captioning requires not only summarizing information from both the target and reference image group but also contrasting between them. To solve this problem, we propose a framework combining self-attention mechanism with contrastive feature construction to effectively summarize common information from each image group while capturing discriminative information between them. To build the dataset for this task, we propose to group the images and generate the group captions based on single image captions using scene graphs matching. Our datasets are constructed on top of the public Conceptual Captions dataset and our new Stock Captions dataset. Experiments on the two datasets show the effectiveness of our method on this new task.
机译:在图像字幕快速发展的同时,现有作品主要集中在描述单个图像上。在本文中,我们介绍了一个新任务,即上下文感知的组标题,该任务旨在在另一组相关参考图像的上下文中描述一组目标图像。上下文感知组字幕不仅需要汇总来自目标图像组和参考图像组的信息,还需要在它们之间进行对比。为了解决这个问题,我们提出了一种将自我注意机制与对比特征构造相结合的框架,以有效地总结每个图像组的共同信息,同时捕获它们之间的区别信息。为了构建此任务的数据集,我们建议对图像进行分组并使用场景图匹配基于单个图像标题生成组标题。我们的数据集建立在公共概念字幕数据集和新的股票字幕数据集的基础上。在两个数据集上进行的实验证明了我们的方法在这项新任务上的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号