首页> 外文会议>Annual meeting of the Association for Computational Linguistics >One model, two languages: training bilingual parsers with harmonized treebanks
【24h】

One model, two languages: training bilingual parsers with harmonized treebanks

机译:一种模型,两种语言:使用统一的树库训练双语解析器

获取原文

摘要

We introduce an approach to train lexical-ized parsers using bilingual corpora obtained by merging harmonized treebanks of different languages, producing parsers that can analyze sentences in either of the learned languages, or even sentences that mix both. We test the approach on the Universal Dependency Treebanks, training with MaltParser and MaltOpti-mizer. The results show that these bilingual parsers are more than competitive, as most combinations not only preserve accuracy, but some even achieve significant improvements over the corresponding monolingual parsers. Preliminary experiments also show the approach to be promising on texts with code-switching and when more languages are added.
机译:我们引入一种使用双语语料库训练词汇化解析器的方法,该语料库是通过合并不同语言的和谐树库而获得的,生成的解析器可以分析两种学习语言中的句子,甚至可以分析两种语言的句子。我们在MaltParser和MaltOpti-mizer的培训下,在通用依赖树库上测试了该方法。结果表明,这些双语解析器比竞争产品更具竞争优势,因为大多数组合不仅保留了准确性,而且某些组合甚至比相应的单语解析器有了显着改进。初步实验还表明,这种方法在带有代码切换功能的文本以及添加更多语言时很有希望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号