首页> 外文学位 >Acquisition and application of contextual role knowledge for coreference resolution.
【24h】

Acquisition and application of contextual role knowledge for coreference resolution.

机译:获取和应用上下文角色知识以实现共指解析。

获取原文
获取原文并翻译 | 示例

摘要

Coreference resolution is the process of identifying when two noun phrases (NP) refer to the same entity. This dissertation makes two main contributions to computational coreference resolution.; First, this work contributes a new method for recognizing when an NP is anaphoric. Most pronouns have an antecedent, but many definite noun phrases do not. I present an unsupervised model for learning nonanaphoric definite NPs from a text collection, and I show that it learns lists of these noun phrases with good accuracy. Recall of these NPs increases from 43% to 79%. I also demonstrate that using these lists to filter nonanaphoric definite NPs prior to coreference resolution provides a mechanism for effecting a recall/precision tradeoff. In two distinct testing domains, recall is traded for precision, leading to precision increases from 60% to 73% and from 68% to 82%.; Second, traditional approaches to coreference resolution typically select the most appropriate antecedent by recognizing word similarity, proximity, and agreement in number, gender, and semantic class. This work contributes a new source of evidence that focuses on the roles that an anaphor and antecedent play in particular events or relationships. I show that using contextual role knowledge as part of the coreference resolution process increases the number of anaphors that can be resolved, and I demonstrate an unsupervised method for acquiring contextual role knowledge that does not require an annotated training corpus. A probabilistic model based on the Dempster-Shafer model of evidence is used to incorporate contextual role knowledge with traditional evidence sources. Among the advantages of this model is the capability to assign evidence to a set of candidates when a knowledge source is unable to distinguish among them. In the two testing domains, the F-measure of anaphor/antecedent pairs increases from 0.57 to 0.61 and from 0.57 to 0.63. Recall increases from 46% to 53% and from 42% to 51% with only minor reductions in precision.
机译:共指解析是识别两个名词短语(NP)何时指代同一实体的过程。本文对计算共参考分辨率做出了两个主要贡献。首先,这项工作为识别NP隐喻时提供了一种新的方法。大多数代词都有一个先行词,但许多定冠词短语却没有。我提出了一种用于从文本集合中学习非指称确定NP的无监督模型,并且表明该模型可以很好地学习这些名词短语的列表。这些NP的召回率从43%增加到79%。我还演示了使用这些列表在共参考解析之前过滤非指代确定性NP提供了一种实现召回/精确度折衷的机制。在两个截然不同的测试领域中,召回率是以精度为代价的,导致精度从60%提高到73%,从68%提高到82%。其次,传统的共指解析方法通常是通过识别单词的相似性,相近性和数量,性别和语义类别上的一致来选择最合适的先行词。这项工作提供了一个新的证据来源,重点放在了回指和先行词在特定事件或关系中所扮演的角色。我展示了将上下文角色知识用作共指解析过程的一部分会增加可以解决的照应的数量,并且我演示了一种无需监督的获取上下文角色知识的方法,该方法不需要带注释的训练语料库。基于证据的Dempster-Shafer模型的概率模型用于将上下文角色知识与传统证据源相结合。该模型的优点之一是能够在知识来源无法区分候选人时为一组候选人分配证据。在这两个测试域中,回指/先行对的F度量从0.57增加到0.61,从0.57增加到0.63。召回率从46%增至53%,从42%增至51%,而精度仅稍有下降。

著录项

  • 作者

    Bean, David L.;

  • 作者单位

    The University of Utah.;

  • 授予单位 The University of Utah.;
  • 学科 Computer Science.; Language Linguistics.
  • 学位 Ph.D.
  • 年度 2004
  • 页码 186 p.
  • 总页数 186
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;语言学;
  • 关键词

  • 入库时间 2022-08-17 11:43:18

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号