首页> 外文OA文献 >LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese
【2h】

LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese

机译:LempORT:葡萄牙语的高精度跨平台Lemmatizer

摘要

Although lemmatization is a very common subtask in many natural language processing tasks, there is a lack of available true cross-platform lemmatization tools specifically targeted for Portuguese, namely for integration in projects developed in Java. To address this issue, we have developed a lemmatizer, initially just for our own use, but which we have decided to make publicly available. The lemmatizer, presented in this document, yields an overall accuracy over 98% when compared against a manually revised corpus.
机译:尽管在许多自然语言处理任务中,词形化是一个非常常见的子任务,但仍然缺少专门针对葡萄牙语的可用的真正跨平台词形化工具,即无法集成到用Java开发的项目中。为了解决这个问题,我们开发了一种词消消解器,最初只是供我们自己使用,但我们决定将其公开发布。与手动修订的语料库相比,本文档中提供的lemmatizer的整体准确性超过98%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号