【24h】

Comprehension-Guided Referring Expressions

机译:理解指导的指称表达

获取原文

摘要

We consider generation and comprehension of natural language referring expression for objects in an image. Unlike generic image captioning which lacks natural standard evaluation criteria, quality of a referring expression may be measured by the receivers ability to correctly infer which object is being described. Following this intuition, we propose two approaches to utilize models trained for comprehension task to generate better expressions. First, we use a comprehension module trained on human-generated expressions, as a critic of referring expression generator. The comprehension module serves as a differentiable proxy of human evaluation, providing training signal to the generation module. Second, we use the comprehension model in a generate-and-rerank pipeline, which chooses from candidate expressions generated by a model according to their performance on the comprehension task. We show that both approaches lead to improved referring expression generation on multiple benchmark datasets.
机译:我们考虑图像中对象的自然语言引用表达的生成和理解。与缺少自然标准评估标准的通用图像字幕不同,可以通过接收者正确推断正在描述的对象的能力来衡量引用表达的质量。根据这种直觉,我们提出两种方法来利用为理解任务训练的模型来生成更好的表达式。首先,我们使用对人为生成的表达式进行训练的理解模块,作为引用表达式生成器的批评者。理解模块充当人类评估的可区分代理,为生成模块提供训练信号。其次,我们在生成和重排管道中使用理解模型,该模型根据模型在理解任务上的性能从模型生成的候选表达式中进行选择。我们表明,这两种方法都可以改善在多个基准数据集上的引用表达的生成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号