首页> 外文会议>Language processing and intelligent information systems >Automatic Detection of Annotation Errors in Polish-Language Corpora
【24h】

Automatic Detection of Annotation Errors in Polish-Language Corpora

机译:波兰语语料库中注释错误的自动检测

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this article we propose an extension to the variation n-gram based method of detecting annotation errors. We also show an approach to finding anomalies in the morphosyntactic annotation layer by using association rule discovery. As no research has previously been done in the field of morphosyntactic annotation error correction for Polish, we provide novel results based on experiments on the largest available Polish language corpus, the National Corpus of Polish (NCP). We also discuss the differences in the approaches used earlier for English language data and the method proposed in this article, taking into account the characteristics of Polish language.
机译:在本文中,我们提出了对基于变体n-gram的检测注释错误的方法的扩展。我们还展示了一种通过使用关联规则发现来在句法注释层中查找异常的方法。由于以前在波兰语的句法语法注释错误校正领域尚未进行任何研究,因此我们基于最大的波兰语语料库,即波兰国家语料库(NCP)的实验,提供了新颖的结果。考虑到波兰语的特点,我们还将讨论早期用于英语数据的方法和本文提出的方法的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号