首页> 外文会议>European Conference on Computer Vision >Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning
【24h】

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

机译:在另一侧找到它:改变标题的一个视点适配的匹配编码器

获取原文

摘要

Change Captioning is a task that aims to describe the difference between images with natural language. Most existing methods treat this problem as a difference judgment without the existence of distractors, such as viewpoint changes. However, in practice, viewpoint changes happen often and can overwhelm the semantic difference to be described. In this paper, we propose a novel visual encoder to explicitly distinguish viewpoint changes from semantic changes in the change captioning task. Moreover, we further simulate the attention preference of humans and propose a novel reinforcement learning process to fine-tune the attention directly with language evaluation rewards. Extensive experimental results show that our method outperforms the state-of-the-art approaches by a large margin in both Spot-the-Diff and CLEVR-Change datasets.
机译:更改标题是一个旨在描述具有自然语言图像之间的区别的任务。大多数现有方法将此问题视为差异判断,而不会存在分散的人,例如观点变化。然而,在实践中,观点通常发生变化,并且可以压倒要描述的语义差异。在本文中,我们提出了一种新颖的视觉编码器,以显式区分从改变标题任务中的语义变化的观点变化。此外,我们进一步模拟了人类的关注偏好,并提出了一种新颖的加强学习过程,以直接用语言评估奖励微调注意力。广泛的实验结果表明,我们的方法在差异和Clevr变化数据集中的大幅度优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号