首页> 外文会议>9th International conference on language resources and evaluation >Improving Entity Linking using Surface Form Refinement
【24h】

Improving Entity Linking using Surface Form Refinement

机译:使用表面形状细化改善实体链接

获取原文

摘要

In this paper, we present an algorithm for improving named entity resolution and entity linking by using surface form generation and rewriting. Surface forms consist of a word or a group of words that matches lexical units like Paris or New York City. Used as matching sequences to select candidate entries in a knowledge base, they contribute to the disambiguation of those candidates through similarity measures. In this context, misspelled textual sequences (entities) can be impossible to identify due to the lack of available matching surface forms. To address this problem, we propose an algorithm for surface form refinement based on Wikipedia resources. The approach extends the surface form coverage of our entity linking system, and rewrites or reformulates misspelled mentions (entities) prior to starting the annotation process. The algorithm is evaluated on the corpus associated with the monolingual English entity linking task of N1ST KBP 2013. We show that the algorithm improves the entity linking system performance.
机译:在本文中,我们提出了一种通过使用表面形式生成和重写来提高命名实体分辨率和实体链接的算法。表面形式由一个单词或一组单词组成,这些单词或单词组与词汇单位(如巴黎或纽约)相匹配。它们用作匹配序列以选择知识库中的候选条目,它们通过相似性度量有助于消除这些候选者的歧义。在这种情况下,由于缺少可用的匹配表面形式,因此无法识别拼写错误的文本序列(实体)。为了解决这个问题,我们提出了一种基于维基百科资源的表面形状细化算法。该方法扩展了我们实体链接系统的表面形式覆盖范围,并在开始注释过程之前重写或重新拼写了错误拼写的提及(实体)。该算法在与N1ST KBP 2013的单语英语实体链接任务相关的语料库上进行了评估。我们证明了该算法提高了实体链接系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号