首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Comprehension-Guided Referring Expressions

【24h】

Comprehension-Guided Referring Expressions

机译：理解指导的指称表达

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider generation and comprehension of natural language referring expression for objects in an image. Unlike generic image captioning which lacks natural standard evaluation criteria, quality of a referring expression may be measured by the receivers ability to correctly infer which object is being described. Following this intuition, we propose two approaches to utilize models trained for comprehension task to generate better expressions. First, we use a comprehension module trained on human-generated expressions, as a critic of referring expression generator. The comprehension module serves as a differentiable proxy of human evaluation, providing training signal to the generation module. Second, we use the comprehension model in a generate-and-rerank pipeline, which chooses from candidate expressions generated by a model according to their performance on the comprehension task. We show that both approaches lead to improved referring expression generation on multiple benchmark datasets.

机译：我们考虑图像中对象的自然语言引用表达的生成和理解。与缺少自然标准评估标准的通用图像字幕不同，可以通过接收者正确推断正在描述的对象的能力来衡量引用表达的质量。根据这种直觉，我们提出两种方法来利用为理解任务训练的模型来生成更好的表达式。首先，我们使用对人为生成的表达式进行训练的理解模块，作为引用表达式生成器的批评者。理解模块充当人类评估的可区分代理，为生成模块提供训练信号。其次，我们在生成和重排管道中使用理解模型，该模型根据模型在理解任务上的性能从模型生成的候选表达式中进行选择。我们表明，这两种方法都可以改善在多个基准数据集上的引用表达的生成。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition 》|2017年|3125-3134|共10页
会议地点
作者
Ruotian Luo; Gregory Shakhnarovich;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Generators; Training; Visualization; Birds; Gallium nitride; Context modeling;

机译：发生器;训练;可视化;鸟类;氮化镓;上下文建模;

相似文献

外文文献
中文文献
专利

1. Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions [J] . Niu Yulei, Zhang Hanwang, Lu Zhiwu, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021 ,第1期

机译：变分语境：利用用于接地的视觉和文本上下文表达式
2. Generating unambiguous and diverse referring expressions [J] . Nikolaos Panagiaris, Emma Hart, Dimitra Gkatzia Computer speech and language . 2021 ,第Jula期

机译：生成明确和不同的引用表达式
3. INGRESS: Interactive visual grounding of referring expressions [J] . Shridhar Mohit, Mittal Dixant, Hsu David The International journal of robotics research . 2020 ,第2a3期

机译：INGRESS：引用表达的交互式视觉基础
4. Comprehension-Guided Referring Expressions [C] . Ruotian Luo, Gregory Shakhnarovich IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：理解引导的参考表达
5. Object Localization from RGB-D Images and Spatial Referring Expressions [D] . Mauceri, Cecilia. 2021

机译：来自RGB-D图像和空间引用表达式的对象本地化
6. Stored object knowledge and the production of referring expressions: the case of color typicality [O] . Hans Westerbeek, Ruud Koolen, Alfons Maes -1

机译：存储对象知识和引用表达式的产生：颜色典型性的情况
7. Comprehension-guided referring expressions [O] . Luo, Ruotian, Shakhnarovich, Gregory 2017

机译：理解指导的参考表达

Comprehension-Guided Referring Expressions

摘要

著录项

相似文献

相关主题

期刊订阅