首页> 外文会议>International conference on language resources and evaluation >AnIta: a powerful morphological analyser for Italian
【24h】

AnIta: a powerful morphological analyser for Italian

机译:Anita:一种强大的形态学分析仪,适用于意大利语

获取原文

摘要

In this paper we present Anita, a powerful morphological analyser for Italian implemented within the framework of finite-state-automata models. It is provided by a large lexicon containing more than 110,000 lemmas that enable it to cover relevant portions of Italian texts. We describe our design choices for the management of inflectional phenomena as well as some interesting new features to explicitly handle derivational and compositional processes in Italian, namely the wordform segmentation structure and Derivation Graph. Two different evaluation experiments, for testing coverage (Recall) and Precision, are described in detail, comparing the Anita performances with some other freely available tools to handle Italian morphology. The experiments results show that the Anita Morphological Analyser obtains the best performances among the tested systems, with Recall = 97.21% and Precision = 98.71%. This tool was a fundamental building block for designing a performant PoS-tagger and Lemmatiser for the Italian language that participated to two EVALITA evaluation campaigns ranking, in both cases, together with the best performing systems.
机译:在本文中,我们展示了Anita,一种强大的形态学分析仪,用于意大利人在有限状态 - 自动机模型框架内实施。它由含有超过110,000个lemmas的大型lexicon提供,使其能够涵盖意大利文本的相关部分。我们描述了我们的设计选择,以管理拐点现象以及一些有趣的新功能,明确地处理意大利语中的衍生和组成过程,即字形分割结构和推导图。详细描述了两个不同的评估实验,用于测试覆盖(召回)和精度,将Anita表演与其他一些可自由的工具进行比较以处理意大利形态。实验结果表明,Anita形态分析仪在测试系统中获得了最佳性能,召回= 97.21%和精度= 98.71%。该工具是设计一个基本的构建块,用于为意大利语言设计一个表演POS-Tagger和LEMMATISER,这些语言参与两种案例在两种情况下都参与了两种情况,以及最好的执行系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号