首页> 外文会议>European Conference on Computer Vision >Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

【24h】

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

机译：在另一侧找到它：改变标题的一个视点适配的匹配编码器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Change Captioning is a task that aims to describe the difference between images with natural language. Most existing methods treat this problem as a difference judgment without the existence of distractors, such as viewpoint changes. However, in practice, viewpoint changes happen often and can overwhelm the semantic difference to be described. In this paper, we propose a novel visual encoder to explicitly distinguish viewpoint changes from semantic changes in the change captioning task. Moreover, we further simulate the attention preference of humans and propose a novel reinforcement learning process to fine-tune the attention directly with language evaluation rewards. Extensive experimental results show that our method outperforms the state-of-the-art approaches by a large margin in both Spot-the-Diff and CLEVR-Change datasets.

机译：更改标题是一个旨在描述具有自然语言图像之间的区别的任务。大多数现有方法将此问题视为差异判断，而不会存在分散的人，例如观点变化。然而，在实践中，观点通常发生变化，并且可以压倒要描述的语义差异。在本文中，我们提出了一种新颖的视觉编码器，以显式区分从改变标题任务中的语义变化的观点变化。此外，我们进一步模拟了人类的关注偏好，并提出了一种新颖的加强学习过程，以直接用语言评估奖励微调注意力。广泛的实验结果表明，我们的方法在差异和Clevr变化数据集中的大幅度优于最先进的方法。

著录项

来源
《European Conference on Computer Vision 》|2020年|574-590|共17页
会议地点
作者
Xiangxi Shi; Xu Yang; Jiuxiang Gu; Shafiq Joty; Jianfei Cai;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image captioning; Change captioning; Attention; Reinforcement learning;

机译：图像标题;改变标题;注意力;加强学习;

相似文献

外文文献
中文文献
专利

1. Matching synaptic type with postsynaptic firing class shapes the encoding of either stimulus rate or rate change [J] . Ashutosh Mohan, Mark D McDonnell, Christian Stricker BMC Neuroscience . 2011 ,第SUPPLEMENTa1期

机译：匹配的突触类型与突触后激发类决定刺激率或速率变化的编码
2. Associations of Changes in Organizational Justice with Job Attitudes and Health-Findings from a Prospective Study Using a Matching-Based Difference-in-Difference Approach [J] . Herr Raphael M., Almer Christian, Bosle Catherin, International journal of behavioral medicine . 2020 ,第1期

机译：组织司法的协会在使用基于匹配的差异差异方法的前瞻性研究中与工作态度和健康调查结果
3. Characteristic MRI findings in neonatal nonketotic hyperglycinemia due to sequence changes in GLDC gene encoding the enzyme glycine decarboxylase [J] . KanekarS., BylerD. Metabolic brain disease . 2013 ,第4期

机译：新生儿非酮症性高血糖血症的特征性MRI表现，归因于编码甘氨酸脱羧酶的GLDC基因的序列变化
4. Finding Captions in PDF-Documents for Semantic Annotations of Images [C] . Gerd Maderlechner, Jiri Panyr, Peter Suda Joint IAPR International Workshops on Structural, Syntactic, and Statistical Pattern Recognition(SSPR 2006 and SPR 2006); 20060817-19; Hong Kong(CN) . 2006

机译：在PDF文档中查找图像语义注释的标题
5. Captions for All? Validating the Effect of Captions on L2 Learners with Different Online Processing Profiles =人人都需要 “字幕” 嗎?探討不同學習行為如何影響字幕對英語聽力理解之效度 [D] . Kam, Emily Fen. 2018

机译：Captions for All? Validating the Effect of Captions on L2 Learners with Different Online Processing Profiles =人人都需要 “字幕” 吗?探讨不同学习行为如何影响字幕对英语听力理解之效度
6. Matching synaptic type with postsynaptic firing class shapes the encoding of either stimulus rate or rate change [O] . Ashutosh Mohan, Mark D McDonnell, Christian Stricker 2011

机译：匹配的突触类型与突触后激发类决定刺激率或速率变化的编码
7. Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning [O] . Xiangxi Shi, Xu Yang, Jiuxiang Gu, 2020

机译：在另一侧找到它：改变标题的一个视点适配的匹配编码器
8. Privacy and Security Solutions for Interoperable Health Information Exchange: Perspectives on Patient Matching: Approaches, Findings, and Challenges [R] . Dimitropoulos, L. L. 2009

机译：互操作健康信息交换的隐私和安全解决方案：关于患者匹配的观点：方法，发现和挑战

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

摘要

著录项

相似文献

相关主题

期刊订阅