首页> 外文会议>9th International conference on language resources and evaluation >An Out-of-Domain Test Suite for Dependency Parsing of German
【24h】

An Out-of-Domain Test Suite for Dependency Parsing of German

机译:用于依赖项分析的域外测试套件

获取原文

摘要

We present a dependency conversion of five German test sets from five different genres. The dependency representation is made as similar as possible to the dependency representation of TiGer, one of the two big syntactic treebanks of German. The purpose of these test sets is to enable researchers to test dependency parsing models on several different data sets from different text genres. We discuss some easy to compute statistics to demonstrate the variation and differences in the test sets and provide some baseline experiments where we test the effect of additional lexical knowledge on the out-of-domain performance of two state-of-the-art dependency parsers. Finally, we demonstrate with three small experiments that text normalization may be an important step in the standard processing pipeline when applied in an out-of-domain setting.
机译:我们提出了五个不同流派的五个德国测试集的依存关系转换。依赖项表示与TiGer(德语的两个大语法树库之一)的依赖项表示尽可能相似。这些测试集的目的是使研究人员能够测试来自不同文本类型的多个不同数据集上的依赖项解析模型。我们讨论了一些易于计算的统计数据,以证明测试集中的变化和差异,并提供一些基线实验,在这些实验中我们测试了附加词汇知识对两个最新的依赖解析器的域外性能的影响。最后,我们通过三个小实验证明,将文本规范化应用到域外设置中可能是标准处理管道中的重要步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号