【24h】

Linking Named Entities to Any Database

机译:将命名实体链接到任何数据库

获取原文

摘要

Existing techniques for disambiguating named entities in text mostly focus on Wikipedia as a target catalog of entities. Yet for many types of entities, such as restaurants and cult movies, relational databases exist that contain far more extensive information than Wikipedia. This paper introduces a new task, called Open-Database Named-Entity Disambiguation (Open-DB NED), in which a system must be able to resolve named entities to symbols in an arbitrary database, without requiring labeled data for each new database. We introduce two techniques for Open-DB NED, one based on distant supervision and the other based on domain adaptation. In experiments on two domains, one with poor coverage by Wikipedia and the other with near-perfect coverage, our Open-DB NED strategies outperform a state-of-the-art Wikipedia NED system by over 25% in accuracy.
机译:现有的用于消除文本中命名实体歧义的技术主要集中在Wikipedia上,作为实体的目标目录。但是对于许多类型的实体,例如饭店和电影院,存在关系数据库,其中包含的信息远比Wikipedia广泛。本文介绍了一项称为开放数据库命名实体歧义消除(Open-DB NED)的新任务,其中系统必须能够将命名实体解析为任意数据库中的符号,而无需为每个新数据库使用带标签的数据。我们介绍了两种用于Open-DB NED的技术,一种基于远程监管,另一种基于域自适应。在两个领域的实验中,一个领域的维基百科覆盖率很差,另一个领域的覆盖率则接近完美,我们的Open-DB NED策略的准确性比最新的Wikipedia NED系统高25%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号