Harnessing Historical Corrections to Build Test Collections for Named Entity Disambiguation

机译：利用历史更正为命名实体消除歧义建立测试集合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Matching mentions of persons to the actual persons (the name disambiguation problem) is central for many digital library applications. Scientists have been working on algorithms to create this matching for decades without finding a universal solution. One problem is that test collections for this problem are often small and specific to a certain collection. In this work, we present an approach that can create large test collections from historical metadata with minimal extra cost. We apply this approach to the dblp collection to generate two freely available test collections. One collection focuses on the properties of name-related defects (such as similarities of synonymous names) and one on the evaluation of disambiguation algorithms.

机译：将提及的人员与实际的人员相匹配（名称消除歧义问题）是许多数字图书馆应用程序的核心。数十年来，科学家一直在研究算法来创建这种匹配，而没有找到通用的解决方案。一个问题是，针对该问题的测试集合通常很小，并且特定于某个集合。在这项工作中，我们提出了一种方法，可以以最低的额外成本从历史元数据创建大型测试集合。我们将此方法应用于dblp集合，以生成两个免费可用的测试集合。一个集合关注于与名称相关的缺陷的属性（例如，同义名称的相似性），另一个关注于对歧义消除算法的评估。

著录项

来源
《International conference on theory and practice of digital libraries》|2018年|47-58|共12页
会议地点
作者
Florian Reitz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Name disambiguation; Historical metadata; dblp;

机译：名称歧义;历史元数据; dblp;
入库时间 2022-08-26 13:51:36

相似文献

外文文献
中文文献
专利

1. Using lexical disambiguation and named-entity recognition to improve spelling correction in the electronic patient record [J] . Patrick Ruch, Robert Baud, Antoine Geissbuehler Artificial intelligence in medicine . 2003,第1a2期

机译：使用词汇歧义消除和命名实体识别来改善电子病历中的拼写校正
2. Disambiguating the Twitter Stream Entities and Enhancing the Search Operation Using DBpedia Ontology: Named Entity Disambiguation for Twitter Streams [J] . N. Senthil Kumar, Dinakaran Muruganantham International journal of information technology and web engineering . 2016,第2期

机译：使用DBpedia本体消除Twitter流实体的歧义并增强搜索操作：Twitter流的命名实体歧义
3. Exploring entity recognition and disambiguation for cultural heritage collections [J] . van Hooland Seth, De Wilde Max, Verborgh Ruben, Literary & linguistic computing . 2015,第2期

机译：探索实体对文化遗产收藏的认可和消除歧义
4. Harnessing Historical Corrections to Build Test Collections for Named Entity Disambiguation [C] . Florian Reitz International Conference on Theory and Practice of Digital Libraries . 2018

机译：利用历史修正以构建名为实体消歧的测试收藏
5. Robust unsupervised named-entity disambiguation [D] . Chen, Ying 2008

机译：鲁棒的无监督命名实体歧义消除
6. Ambiguity of Human Gene Symbols in LocusLink and MEDLINE: Creating an Inventory and a Disambiguation Test Collection [O] . Marc Weeber, Bob J. A. Schijvenaars, Erik M. van Mulligen, 2003

机译：LocusLink和MEDLINE中人类基因符号的歧义：创建清单和歧义测试集
7. Domain-specific named entity disambiguation in historical memoirs [O] . Rovera Marco, Nanni Federico, Ponzetto Simone Paolo, 2017

机译：历史回忆录中特定于域的命名实体歧义消除

Harnessing Historical Corrections to Build Test Collections for Named Entity Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅