【24h】

SyntaxNet Errors from the Linguistic Point of View

机译:语言观点的语法错误

获取原文

摘要

The paper deals with Google's universal parser SyntaxNet. The system was used to analyze the Universal Dependencies linguistic corpora. We conducted an error analysis of the output of the parser to reveal to what extent the error types are connected with or preconditioned by the language types. In particular, we carried out several experiments, clustering the languages based on the frequency of different errors made by SyntaxNet, and studied the similarity of the resulting clustering with the traditional typology of languages. Three types of errors were separately considered: part-of-speech tagging, dependency labeling, and attachment errors. We show that there is indeed a correlation between error frequencies and language types, which might indicate that to further improve the performance of a universal parser, one needs to take into account language-specific morphological and syntactic structures.
机译:本文处理了Google的通用Parser语法。该系统用于分析普遍依赖性语言信息。我们对解析器的输出进行了一个错误分析,以揭示错误类型与语言类型的错误类型或预先处理的程度。特别是,我们进行了几个实验,基于SyntaxNet所做的不同误差的频率聚类语言,并研究了由传统语言的传统类型学结果聚类的相似性。单独考虑三种类型的错误:词语兼容标记,依赖标记和附件错误。我们表明,错误频率和语言类型之间存在相关性,这可能表明可以进一步提高通用解析器的性能,因此需要考虑语言特定的形态和句法结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号