首页> 外文会议>International conference on recent advances in natural language processing >A Domain and Language Independent Named Entity Classification Approach Based on Profiles and Local Information
【24h】

A Domain and Language Independent Named Entity Classification Approach Based on Profiles and Local Information

机译:基于配置文件和本地信息的领域和语言无关的命名实体分类方法

获取原文

摘要

This paper presents a Named Entity Classification system, which employs machine learning. Our methodology employs local entity information and profiles as feature set. All features are generated in an unsupervised manner. It is tested on two different data sets: (i) DrugSemantics Spanish corpus (Overall F1 = 74.92), whose results are in-line with the state of the art without employing external domain-specific resources. And, (ii) English CoNLL2003 dataset (Overall F1 = 81.40), although our results are slightly lower than previous work, these are reached without external knowledge or complex linguistic analysis. Last, using the same configuration for the two corpora, the difference of overall F1 is only 6.48 points (DrugSemantics = 74.92 versus CoNLL2003 = 81.40). Thus, this result supports our hypothesis that our approach is language and domain independent and does not require any external knowledge or complex linguistic analysis.
机译:本文提出了一种采用机器学习的命名实体分类系统。我们的方法采用本地实体信息和配置文件作为功能集。所有功能均以无人监督的方式生成。它在两个不同的数据集上进行了测试:(i)DrugSemantics西班牙语语料库(总体F1 = 74.92),其结果与最新技术一致,没有使用外部特定领域的资源。并且,(ii)英语CoNLL2003数据集(总体F1 = 81.40),尽管我们的结果比以前的研究要低一些,但是这些结果是在没有外部知识或复杂语言分析的情况下得出的。最后,对于两个语料库使用相同的配置,总体F1的差异仅为6.48点(DrugSemantics = 74.92 vs CoNLL2003 = 81.40)。因此,这一结果支持了我们的假设,即我们的方法是语言和领域独立的,并且不需要任何外部知识或复杂的语言分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号