首页> 外文会议>International semantic web conference >AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data
【24h】

AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data

机译:AGDISTIS-使用链接数据对命名实体进行基于图的消歧

获取原文

摘要

Over the last decades, several billion Web pages have been made available on the Web. The ongoing transition from the current Web of unstructured data to the Web of Data yet requires scalable and accurate approaches for the extraction of structured data in RDF (Resource Description Framework) from these websites. One of the key steps towards extracting RDF from text is the disambiguation of named entities. While several approaches aim to tackle this problem, they still achieve poor accuracy. We address this drawback by presenting AGDISTIS, a novel knowledge-base-agnostic approach for named entity disambiguation. Our approach combines the Hypertext-Induced Topic Search (HITS) algorithm with label expansion strategies and string similarity measures. Based on this combination, AGDISTIS can efficiently detect the correct URIs for a given set of named entities within an input text. We evaluate our approach on eight different datasets against state-of-the-art named entity disambiguation frameworks. Our results indicate that we outperform the state-of-the-art approach by up to 29% F-measure.
机译:在过去的几十年中,Web上已经有数十亿个Web页面可用。从当前的非结构化数据Web到数据Web的持续过渡,仍然需要可伸缩且准确的方法来从这些网站中提取RDF(资源描述框架)中的结构化数据。从文本中提取RDF的关键步骤之一是消除命名实体的歧义。尽管有几种方法旨在解决此问题,但它们仍然达到较差的精度。我们通过提出AGDISTIS(一种解决命名实体歧义的新颖的与知识库无关的方法)来解决此缺点。我们的方法将超文本诱导主题搜索(HITS)算法与标签扩展策略和字符串相似性度量相结合。基于此组合,AGDISTIS可以有效地为输入文本内的给定命名实体集检测正确的URI。我们针对最新的命名实体消歧框架在八个不同的数据集上评估了我们的方法。我们的结果表明,我们通过最先进的方法获得了高达29%的F值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号