One model, two languages: training bilingual parsers with harmonized treebanks

机译：一种模型，两种语言：使用统一的树库训练双语解析器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce an approach to train lexical-ized parsers using bilingual corpora obtained by merging harmonized treebanks of different languages, producing parsers that can analyze sentences in either of the learned languages, or even sentences that mix both. We test the approach on the Universal Dependency Treebanks, training with MaltParser and MaltOpti-mizer. The results show that these bilingual parsers are more than competitive, as most combinations not only preserve accuracy, but some even achieve significant improvements over the corresponding monolingual parsers. Preliminary experiments also show the approach to be promising on texts with code-switching and when more languages are added.

机译：我们引入一种使用双语语料库训练词汇化解析器的方法，该语料库是通过合并不同语言的和谐树库而获得的，生成的解析器可以分析两种学习语言中的句子，甚至可以分析两种语言的句子。我们在MaltParser和MaltOpti-mizer的培训下，在通用依赖树库上测试了该方法。结果表明，这些双语解析器比竞争产品更具竞争优势，因为大多数组合不仅保留了准确性，而且某些组合甚至比相应的单语解析器有了显着改进。初步实验还表明，这种方法在带有代码切换功能的文本以及添加更多语言时很有希望。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2016年|425-431|共7页
会议地点
作者
David Vilares; Carlos Gomez-Rodriguez; Miguel A. Alonso;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Bitext Dependency Parsing With Auto-Generated Bilingual Treebank [J] . Chen W., Kazama J., Zhang M., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第5期

机译：自动生成的双语树库的双文本相关性解析
2. New treebank or repurposed? On the feasibility of cross-lingual parsing of Romance languages with Universal Dependencies [J] . MARCOS GARCIA, CARLOS GOMEZ-RODRIGUEZ, MIGUEL A. ALONSO Natural language engineering . 2018,第pta1期

机译：新的树库还是已重新利用？关于具有普遍依赖性的浪漫语言跨语言解析的可行性
3. HamleDT: Harmonized multi-language dependency treebank [J] . Daniel Zeman, Ondrej Dusek, David Marecek, Language Resources and Evaluation . 2014,第4期

机译：HamleDT：统一的多语言依赖树库
4. One model, two languages: training bilingual parsers with harmonized treebanks [C] . David Vilares, Carlos Gomez-Rodriguez, Miguel A. Alonso Annual meeting of the Association for Computational Linguistics . 2016

机译：一个模型，两种语言：培养双语解析器与统一的树木银行
5. Leveraging Training Data from High-Resource Languages to Improve Dependency Parsing for Low-Resource Languages [D] . Jaja, Claire. 2014

机译：利用来自高资源语言的培训数据来改善对低资源语言的依赖关系解析
6. Short‐term language switching training tunes the neural correlates of cognitive control in bilingual language production [O] . Chunyan Kang, Yongben Fu, Junjie Wu, 2017

机译：短期语言切换训练可调节双语语言产生中认知控制的神经相关性
7. One model, two languages: training bilingual parsers with harmonized treebanks [O] . Vilares, David, Gómez-Rodríguez, Carlos, Alonso, Miguel A. 2016

机译：一种模式，两种语言：培训具有协调性的双语解析器树库

One model, two languages: training bilingual parsers with harmonized treebanks

摘要

著录项

相似文献

相关主题

期刊订阅