首页> 外文会议>International conference on theory and practice of digital libraries >Harnessing Historical Corrections to Build Test Collections for Named Entity Disambiguation
【24h】

Harnessing Historical Corrections to Build Test Collections for Named Entity Disambiguation

机译:利用历史更正为命名实体消除歧义建立测试集合

获取原文

摘要

Matching mentions of persons to the actual persons (the name disambiguation problem) is central for many digital library applications. Scientists have been working on algorithms to create this matching for decades without finding a universal solution. One problem is that test collections for this problem are often small and specific to a certain collection. In this work, we present an approach that can create large test collections from historical metadata with minimal extra cost. We apply this approach to the dblp collection to generate two freely available test collections. One collection focuses on the properties of name-related defects (such as similarities of synonymous names) and one on the evaluation of disambiguation algorithms.
机译:将提及的人员与实际的人员相匹配(名称消除歧义问题)是许多数字图书馆应用程序的核心。数十年来,科学家一直在研究算法来创建这种匹配,而没有找到通用的解决方案。一个问题是,针对该问题的测试集合通常很小,并且特定于某个集合。在这项工作中,我们提出了一种方法,可以以最低的额外成本从历史元数据创建大型测试集合。我们将此方法应用于dblp集合,以生成两个免费可用的测试集合。一个集合关注于与名称相关的缺陷的属性(例如,同义名称的相似性),另一个关注于对歧义消除算法的评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号