首页> 外文会议>International conference on language resources and evaluation >Further Developments in Treebank Error Detection Using Derivation Trees
【24h】

Further Developments in Treebank Error Detection Using Derivation Trees

机译:使用派生树在树库错误检测中的进一步发展

获取原文

摘要

This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG) can be used to check treebank consistency. Annotation of word sequences are compared both for their internal structural consistency, and their external relation to the rest of the tree. We expand on earlier work in this area in three ways. First, we provide a more complete description of the system, showing how a naive use of TAG structures will not work, leading to a necessary refinement. We also provide a more complete account of the processing pipeline, including the grouping together of structurally similar errors and their elimination of duplicates. Second, we include the new experimental external relation check to find an additional class of errors. Third, we broaden the evaluation to include both the internal and external relation checks, and evaluate the system on both an Arabic and English treebank. The evaluation has been successful enough that the internal check has been integrated into the standard pipeline for current English treebank construction at the Linguistic Data Consortium.
机译:这项工作描述了如何使用基于树邻接语法(TAG)的派生树片段来检查树库一致性。比较单词序列的注释的内部结构一致性,以及它们与树的其余部分的外部关系。我们通过三种方式扩展了这方面的早期工作。首先,我们提供了对该系统的更完整描述,显示了如何简单地使用TAG结构,从而导致必要的改进。我们还提供了更完整的处理流程说明,包括将结构相似的错误归为一类,并消除了重复项。其次,我们包括新的实验性外部关系检查,以查找其他类别的错误。第三,我们将评估范围扩大到包括内部和外部关系检查,并在阿拉伯树和英语树库上评估该系统。评估已经足够成功,内部语言检查已集成到语言数据协会当前英语树库建设的标准管道中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号