What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues

机译：所见即所得：对话中的视觉代词同指解析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Grounding a pronoun to a visual object it refers to requires complex reasoning from various information sources, especially in conversational scenarios. For example, when people in a conversation talk about something all speakers can see, they often directly use pronouns (e.g., it) to refer to it without previous introduction. This fact brings a huge challenge for modern natural language understanding systems, particularly conventional context-based pronoun coreference models. To tackle this challenge, in this paper, we formally define the task of visual-aware pronoun coreference resolution (PCR) and introduce VisPro, a large-scale dialogue PCR dataset, to investigate whether and how the visual information can help resolve pronouns in dialogues. We then propose a novel visual-aware PCR model. VisCoref, for this task and conduct comprehensive experiments and case studies on our dataset. Results demonstrate the importance of the visual information in this PCR case and show the effectiveness of the proposed model.

机译：代词以其所指的视觉对象为基础，需要来自各种信息源的复杂推理，尤其是在对话场景中。例如，当人们在谈话中谈论所有说话者都能看到的东西时，他们经常直接使用代词（例如，它）来指代它，而无需先前的介绍。这一事实给现代自然语言理解系统，尤其是传统的基于上下文的代词共指模型带来了巨大挑战。为了应对这一挑战，在本文中，我们正式定义了视觉感知代词共指解析（PCR）的任务，并引入了大规模对话PCR数据集VisPro，以研究视觉信息是否以及如何帮助解决对话中的代词。然后，我们提出了一种新颖的视觉感知PCR模型。 VisCoref，为此任务，并在我们的数据集上进行了全面的实验和案例研究。结果证明了在这种PCR情况下视觉信息的重要性，并表明了所提出模型的有效性。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing 》|2019年|5122-5131|共10页
会议地点
作者
Xintong Yu; Hongming Zhang; Yangqiu Song; Yan Song; Changshui Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Ellipsis and Coreference Resolution in a Computerized Virtual Patient Dialogue System [J] . Lin Chuan-Jie, Pao Chien-Wei, Chen Yen-Heng, Journal of medical systems . 2016 ,第9期

机译：计算机化虚拟患者对话系统中的省略和共指解析
2. The Influence of Focus Marking on Pronoun Resolution in Dialogue Context [J] . Liam P. Blything, Juhani J?rvikivi, Abigail G. Toth, Frontiers in Psychology . 2021 ,第a期

机译：对话背景下代词分辨率对焦标记的影响
3. Structural constraints on pronoun binding and coreference: evidence from eye movements during reading [J] . Ian Cunnings, Clare Patterson, Claudia Felser Frontiers in Psychology . 2015 ,第4期

机译：代词绑定和共指的结构性限制：阅读过程中眼睛运动的证据
4. What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues [C] . Xintong Yu, Hongming Zhang, Yangqiu Song, International joint conference on natural language processing . 2019

机译：您所看到的是您获得的内容：视觉代词在对话中的Coreference分辨率
5. Coreference Resolution for Downstream NLP Tasks [D] . Pani, Sushanta Kumar. 2021

机译：下游NLP任务的Coreference分辨率
6. The Influence of Focus Marking on Pronoun Resolution in Dialogue Context [O] . Liam P. Blything, Juhani Järvikivi, Abigail G. Toth, 2021

机译：对话背景下代焦标记对代词决议的影响
7. Coreference resolution: maximum metric score training, domain adaptation, and zero pronoun resolution [O] . ZHAO SHANHENG 2011

机译：共指分辨率：最大度量标准分数训练，域自适应和零代词分辨率
8. Dialogue Structure and Pronoun Resolution [R] . Tetreault, J. R. , Allen, J. F. 2006

机译：对话结构与代词解析

What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues

摘要

著录项

相似文献

相关主题

期刊订阅