首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2012 >Joint Inference of Named Entity Recognition and Normalization for Tweets
【24h】

Joint Inference of Named Entity Recognition and Normalization for Tweets

机译:推文的命名实体识别和规范化联合推断

获取原文

摘要

Tweets represent a critical source of fresh information, in which named entities occur frequently with rich variations. We study the problem of named entity normalization (NEN) for tweets. Two main challenges are the errors propagated from named entity recognition (NER) and the dearth of information in a single tweet. We propose a novel graphical model to simultaneously conduct NER and NEN on multiple tweets to address these challenges. Particularly, our model introduces a binary random variable for each pair of words with the same lemma across similar tweets, whose value indicates whether the two related words are mentions of the same entity. We evaluate our method on a manually annotated data set, and show that our method outperforms the baseline that handles these two tasks separately, boosting the Fl from 80.2% to 83.6% for NER, and the Accuracy from 79.4% to 82.6% for NEN, respectively.
机译:推文代表了新信息的重要来源,在其中,命名实体频繁出现且变化丰富。我们研究了推文的命名实体规范化(NEN)问题。两个主要挑战是从命名实体识别(NER)传播的错误和单个推文中信息的缺乏。我们提出了一种新颖的图形模型,可同时在多个推文上进行NER和NEN的处理,以应对这些挑战。特别是,我们的模型针对相似推文中具有相同引理的每对单词引入了一个二进制随机变量,其值指示两个相关单词是否提及同一实体。我们在手动注释的数据集上评估了我们的方法,结果表明我们的方法优于分别处理这两项任务的基准,将NER的Fl从80.2%提高到83.6%,NEN的准确性从79.4%提高到82.6%,分别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号