首页> 外文会议>IEEE International Conference on Data Mining Workshops >Named Entity Disambiguation Leveraging Multi-aspect Information
【24h】

Named Entity Disambiguation Leveraging Multi-aspect Information

机译:利用多方面信息的命名实体消歧

获取原文

摘要

Named Entity Disambiguation (NED) aims at disambiguating named entity mentions in a text to their corresponding entries in a knowledge base such as Wikipedia. Itis a fundamental task in Natural Language Processing (NLP)and has many applications such as information extraction, information retrieval, and knowledge acquisition. In the past decade, a number of methods have been proposed for the NED task. However, most of existing work focuses on exploring many more useful information to help tackle this problem. The effectiveness of different features proposed for the task are not well-studied in a same platform. In this paper, we extract various remarkable features by leveraging statistical, textual and semantic information, and evaluate various combinations of the multiaspect features for the disambiguation task in the same platform. Specifically, we utilize two learning to rank methods to combine different features, train and test the combined methods on several standard data sets. Through extensive experiments, we investigate the effects on the quality of the disambiguation of exploiting different features and show which combinations of features are the best choices for disambiguation.
机译:命名实体消除歧义(NED)的目的是将文本中的命名实体提及与诸如Wikipedia之类的知识库中的相应条目消除歧义。它是自然语言处理(NLP)的一项基本任务,具有许多应用程序,例如信息提取,信息检索和知识获取。在过去的十年中,已经提出了许多用于NED任务的方法。但是,大多数现有工作都集中于探索更多有用的信息来帮助解决此问题。在同一平台上没有充分研究针对该任务提出的不同功能的有效性。在本文中,我们通过利用统计,文本和语义信息提取各种显着特征,并在同一平台上评估多方面特征的各种组合以实现消歧任务。具体来说,我们利用两种学习对方法进行排序,以组合不同的功能,并在几个标准数据集上训练和测试组合的方法。通过广泛的实验,我们研究了利用不同特征对消歧质量的影响,并显示了哪些特征组合是消歧的最佳选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号