首页> 外文会议>9th International conference on language resources and evaluation >Validation Issues Induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks
【24h】

Validation Issues Induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks

机译:自动预注释机制在非投影依赖树库的构建中引起的验证问题

获取原文

摘要

In order to build large dependency treebanks using the CDG Lab, a grammar-based dependency treebank development tool, an annotator usually has to fill a selection form before parsing. This step is usually necessary because, otherwise, the search space is too big for long sentences and the parser fails to produce at least one solution. With the information given by the annotator on the selection form the parser can produce one or several dependency structures and the annotator can proceed by adding positive or negative annotations on dependencies and launching iteratively the parser until the right dependency structure has been found. However, the selection form is sometimes difficult and long to fill because the annotator must have an idea of the result before parsing. The CDG Lab proposes to replace this form by an automatic pre-annotation mechanism. However, this model introduces some issues during the annotation phase that do not exist when the annotator uses a selection form. The article presents those issues and proposes some modifications of the CDG Lab in order to use effectively the automatic pre-annotation mechanism.
机译:为了使用CDG Lab(基于语法的依赖关系树库开发工具)构建大型依赖关系树库,注释者通常必须在解析之前填写选择表格。此步骤通常是必需的,因为否则搜索空间对于长句子而言太大,并且解析器无法产生至少一个解决方案。利用注释器在选择表上提供的信息,解析器可以生成一个或多个依赖结构,并且注释器可以通过在依赖项上添加正或负注释并迭代启动解析器,直到找到正确的依赖结构来进行操作。但是,由于注释者在解析之前必须对结果有所了解,因此选择表单有时会很困难且填写时间很长。 CDG实验室建议使用自动预注释机制替换此表单。但是,此模型在注释阶段引入了一些问题,这些问题在注释者使用选择表单时不存在。本文介绍了这些问题,并对CDG Lab提出了一些修改,以便有效地使用自动预注释机制。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号