首页> 外文会议>International Conference on Text, Speech and Dialogue >Detecting and Correcting Errors in an English Tectogrammatical Annotation
【24h】

Detecting and Correcting Errors in an English Tectogrammatical Annotation

机译:检测和纠正英语构造中的错误

获取原文

摘要

We present our first experiments with detecting and correcting errors in a manual annotation of English texts, taken from the Penn Treebank, at the dependency-based tectogrammatical layer, as it is defined in the Prague Dependency Treebank. The main idea is that errors in the annotation usually result in an inconsistency, i.e. the state when a phenomenon is annotated in different ways at several places in a corpus. We describe our algorithm for detecting inconsistencies (it got positive feedback from annotators) and we present some statistics on the manually corrected data and results of a tectogrammatical analyzer which uses these data for its operation. The corrections have improved the data just slightly so far, but we outline some ways to more significant improvement.
机译:我们在基于依赖性的构造图层中,我们介绍了检测和纠正了从Penn TreeBank的英语文本的手动注释中的错误,因为它在Prague依赖性树班内定义。主要思想是注释中的错误通常会导致不一致的,即,当一个现象在语料库中的几个地方以不同方式注释时的状态。我们描述了我们用于检测不一致的算法(它获得了注释器的正面反馈),我们向手动纠正的数据和特图标准分析仪的结果提供了一些统计数据,该分析器使用这些数据进行操作。到目前为止,更正的数据略微改进了数据,但我们概述了一些更加重大改进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号