【24h】

What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues

机译:所见即所得:对话中的视觉代词同指解析

获取原文

摘要

Grounding a pronoun to a visual object it refers to requires complex reasoning from various information sources, especially in conversational scenarios. For example, when people in a conversation talk about something all speakers can see, they often directly use pronouns (e.g., it) to refer to it without previous introduction. This fact brings a huge challenge for modern natural language understanding systems, particularly conventional context-based pronoun coreference models. To tackle this challenge, in this paper, we formally define the task of visual-aware pronoun coreference resolution (PCR) and introduce VisPro, a large-scale dialogue PCR dataset, to investigate whether and how the visual information can help resolve pronouns in dialogues. We then propose a novel visual-aware PCR model. VisCoref, for this task and conduct comprehensive experiments and case studies on our dataset. Results demonstrate the importance of the visual information in this PCR case and show the effectiveness of the proposed model.
机译:代词以其所指的视觉对象为基础,需要来自各种信息源的复杂推理,尤其是在对话场景中。例如,当人们在谈话中谈论所有说话者都能看到的东西时,他们经常直接使用代词(例如,它)来指代它,而无需先前的介绍。这一事实给现代自然语言理解系统,尤其是传统的基于上下文的代词共指模型带来了巨大挑战。为了应对这一挑战,在本文中,我们正式定义了视觉感知代词共指解析(PCR)的任务,并引入了大规模对话PCR数据集VisPro,以研究视觉信息是否以及如何帮助解决对话中的代词。然后,我们提出了一种新颖的视觉感知PCR模型。 VisCoref,为此任务,并在我们的数据集上进行了全面的实验和案例研究。结果证明了在这种PCR情况下视觉信息的重要性,并表明了所提出模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号