首页> 外文会议>International conference on computational linguistics >Language Independent Dependency to Constituent Tree Conversion
【24h】

Language Independent Dependency to Constituent Tree Conversion

机译:语言独立性对组成树转换的依赖性

获取原文

摘要

We present a dependency to constituent tree conversion technique that aims to improve constituent parsing accuracies by leveraging dependency treebanks available in a wide variety in many languages. The technique works in two steps. First, a partial constituent tree is derived from a dependency tree with a very simple deterministic algorithm that is both language and dependency type independent. Second, a complete high accuracy constituent tree is derived with a constraint-based parser, which uses the partial constituent tree as external constraints. Evaluated on Section 22 of the WSJ Treebank, the technique achieves the state-of-the-art conversion F-score 95.6. When applied to English Universal Dependency treebank and German CoNLL2006 treebank, the converted treebanks added to the human-annotated constituent parser training corpus improve parsing F-scores significantly for both languages.
机译:我们介绍了对成分树转换技术的依赖性,该技术旨在通过利用多种语言中可用的依赖性树库来提高成分分析的准确性。该技术分为两个步骤。首先,使用非常简单的确定性算法从依赖关系树派生部分组成树,该算法既独立于语言又依赖于依赖关系类型。其次,使用基于约束的解析器导出完整的高精度构成树,该解析器使用部分构成树作为外部约束。在《华尔街日报》树库的第22部分进行了评估,该技术实现了最新的转换F分数95.6。当应用于英语通用依赖树库和德国CoNLL2006树库时,转换后的树库添加到人工标注的成分分析器训练语料库中,可显着改善两种语言的F分数解析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号