首页> 美国卫生研究院文献>other >Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation
【2h】

Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation

机译:预测提及的评分共指分区:参考实现

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The definitions of two coreference scoring metrics— B3 and CEAF—are underspecified with respect to predicted, as opposed to key (or gold) mentions. Several variations have been proposed that manipulate either, or both, the key and predicted mentions in order to get a one-to-one mapping. On the other hand, the metric BLANC was, until recently, limited to scoring partitions of key mentions. In this paper, we (i) argue that mention manipulation for scoring predicted mentions is unnecessary, and potentially harmful as it could produce unintuitive results; (ii) illustrate the application of all these measures to scoring predicted mentions; (iii) make available an open-source, thoroughly-tested reference implementation of the main coreference evaluation measures; and (iv) rescore the results of the CoNLL-2011/2012 shared task systems with this implementation. This will help the community accurately measure and compare new end-to-end coreference resolution algorithms.
机译:相对于预测,相对于关键(或黄金)提及,两个共指评分标准B 3 和CEAF的定义未指定。为了获得一对一的映射,已经提出了对关键和预测提及中的一个或两个进行操纵的几种变型。另一方面,直到最近,指标BLANC还仅限于对关键提示进行分区。在本文中,我们(i)认为对预期的提及进行评分的提及操纵是不必要的,并且可能有害,因为它可能产生不直观的结果; (ii)说明所有这些措施在对预期提及进行评分中的应用; (iii)提供主要共同参考评估措施的开源,经过全面测试的参考实现; (iv)通过此实施对CoNLL-2011 / 2012共享任务系统的结果进行评分。这将帮助社区准确地测量和比较新的端到端共指解析算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号