首页> 外文会议>9th International conference on language resources and evaluation >Comparative Analysis of Portuguese Named Entities Recognition Tools
【24h】

Comparative Analysis of Portuguese Named Entities Recognition Tools

机译:葡萄牙语命名实体识别工具的比较分析

获取原文

摘要

This paper describes an experiment to compare four tools to recognize named entities in Portuguese texts. The experiment was made over the HAREM corpora, a golden standard for named entities recognition in Portuguese. The tools experimented are based on natural language processing techniques and also machine learning. Specifically, one of the tools is based on Conditional random fields, an unsupervised machine learning model that has being used to named entities recognition in several languages, while the other tools follow more traditional natural language approaches. The comparison results indicate advantages for different tools according to the different classes of named entities. Despite of such balance among tools, we conclude pointing out foreseeable advantages to the machine learning based tool.
机译:本文介绍了一种比较四个工具以识别葡萄牙语文本中命名实体的实验。实验是在HAREM语料库上进行的,这是葡萄牙语中命名实体识别的黄金标准。实验的工具基于自然语言处理技术以及机器学习。具体来说,其中一种工具基于条件随机字段,这是一种无监督的机器学习模型,已用于多种语言中的命名实体识别,而其他工具则遵循更传统的自然语言方法。比较结果表明,根据命名实体的不同类别,使用不同工具的优势。尽管工具之间存在这种平衡,但我们得出结论指出了基于机器学习的工具可预见的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号