首页> 外文会议>International Semantic Web Conference >Context and Domain Knowledge Enhanced Entity Spotting in Informal Text
【24h】

Context and Domain Knowledge Enhanced Entity Spotting in Informal Text

机译:背景和域知识增强了非正式文本中的实体

获取原文
获取外文期刊封面目录资料

摘要

This paper explores the application of restricted relationship graphs (RDF) and statistical NLP techniques to improve named entity annotation in challenging Informal English domains. We validate our approach using on-line forums discussing popular music. Named entity annotation is particularly difficult in this domain because it is characterized by a large number of ambiguous entities, such as the Madonna album "Music" or Lilly Allen's pop hit "Smile". We evaluate improvements in annotation accuracy that can be obtained by restricting the set of possible entities using real-world constraints. We find that constrained domain entity extraction raises the annotation accuracy significantly, making an infeasible task practical. We then show that we can further improve annotation accuracy by over 50% by applying SVM based NLP systems trained on word-usages in this domain.
机译:本文探讨了受限制关系图(RDF)和统计NLP技术的应用,以改善挑战非正式英语域中的命名实体注释。我们使用在线论坛验证我们的方法讨论流行音乐。这个领域的命名实体注释特别困难,因为它的特点是大量含糊不清的实体,例如麦当娜专辑“音乐”或莉莉艾伦的流行击中“微笑”。我们评估了通过使用真实约束来限制可能的可能实体来获得的注释精度的改进。我们发现受约束的域实体提取显着提高了注释精度,从而产生了不可行的任务。然后,我们通过应用基于SVM的NLP系统在该域中的Word-Usages上培训,我们可以进一步提高注释精度超过50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号