首页> 外文会议>Conference on computational natural language learning >Delexicalized transfer parsing for low-resource languages using transformed and combined treebanks
【24h】

Delexicalized transfer parsing for low-resource languages using transformed and combined treebanks

机译:使用转换后的树库和合并后的树库对低资源语言进行非词性化传输解析

获取原文

摘要

This paper describes the ITT Kharagpur dependency parsing system in CoNLL-2017 shared task on Multilingual Parsing from Raw Text to Universal Dependencies. We primarily focus on the low-resource languages (surprise languages). We have developed a framework to combine multiple treebanks to train parsers for low resource languages by a delexicalization method. We have applied transformation on the source language treebanks based on syntactic features of the low-resource language to improve performance of the parser. In the official evaluation, our system achieves macro-averaged LAS scores of 67.61 and 37.16 on the entire blind test data and the surprise language test data respectively.
机译:本文介绍了CoNLL-2017共享任务中从原始文本到通用依赖项的多语言解析中的ITT Kharagpur依赖项解析系统。我们主要关注资源匮乏的语言(惊奇语言)。我们已经开发了一个框架,可以通过去词化方法结合多个树库来训练低资源语言的解析器。我们基于低资源语言的语法特征对源语言树库进行了转换,以提高解析器的性能。在官方评估中,我们的系统在整个盲测数据和惊奇语言测试数据上分别获得67.61和37.16的宏观平均LAS分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号