From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers

机译：从零到英雄：关于零拍语言转移与多语言变压器的局限性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Massively multilingual transformers (MMTs) pretrained via language modeling (e.g., mBERT, XLM-R) have become a default paradigm for zero-shot language transfer in NLP, offering unmatched transfer performance. Current evaluations, however, verify their efficacy in transfers (a) to languages with sufficiently large pretraining corpora, and (b) between close languages. In this work, we analyze the limitations of downstream language transfer with MMTs, showing that, much like cross-lingual word embeddings, they are substantially less effective in resource-lean scenarios and for distant languages. Our experiments, encompassing three lower-level tasks (POS tagging, dependency parsing, NER) and two high-level tasks (NLI, QA), empirically correlate transfer performance with linguistic proximity between source and target languages, but also with the size of target language corpora used in MMT pretraining. Most importantly, we demonstrate that the inexpensive few-shot transfer (i.e., additional fine-tuning on a few target-language instances) is surprisingly effective across the board, warranting more research efforts reaching beyond the limiting zero-shot conditions.

机译：通过语言建模（例如，MBERT，XLM-R）的大规模多语言变压器（MMT）已成为NLP中零拍语言传输的默认范式，提供无与伦比的传输性能。然而，目前的评估验证了它们在与足够大的预先预测语料库中的转移（a）的疗效，以及（b）在密封语言之间。在这项工作中，我们分析了与MMT的下游语言转移的局限性，表明，与交叉语言嵌入式相同，它们在资源精度方案和远处语言中基本上不太有效。我们的实验，包括三个级别的任务（POS标记，依赖解析，NER）和两个高级任务（NLI，QA），在源语言和目标语言之间具有语言接近的传递性能，但也具有目标的大小语言语料库使用MMT预制。最重要的是，我们证明了廉价的几次射击转移（即，几个目标语言实例上的额外微调）在董事会上令人惊讶地有效，这需要更多的研究工作，超出限制零射击条件。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing》|2020年|4483-4499|共17页
会议地点
作者
Anne Lauscher; Vinit Ravishankar; Ivan Vulic; Goran Glavas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Robust deep alignment network with remote sensing knowledge graph for zero-shot and generalized zero-shot remote sensing image scene classification [J] . Li Yansheng, Kong Deyu, Zhang Yongjun, ISPRS Journal of Photogrammetry and Remote Sensing . 2021,第Sepa期

机译：具有遥感知识图形的强大的深度对准网络，用于零射击和广义零射遥感图像场景分类
2. Transferrable Feature and Projection Learning with Class Hierarchy for Zero-Shot Learning [J] . Li Aoxue, Lu Zhiwu, Guan Jiechao, International Journal of Computer Vision . 2020,第12期

机译：零射击学习的类层次结构可转移特征和投影学习
3. Zero-Shot Learning With Transferred Samples [J] . Yuchen Guo, Guiguang Ding, Jungong Han, IEEE Transactions on Image Processing . 2017,第7期

机译：零样本学习与转移样本
4. Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models [C] . Po-Yao Huang, Mandela Patrick, Junjie Hu, Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2021

机译：零射击交叉传输视觉模型的多语言多模态预培训
5. Explainability with Semantic Concept Composition and Zero-Shot Learning for Anomaly Detection [D] . Bendre, Nihar Shrikant. 2021

机译：用语义概念组成和对异常检测的零射击学习的解释性
6. From ZeRO to HeRO: Saving lives one HeRO at a time [O] . Dr. Charlotte Ratcliff, Miss Monica Hansrani 2016

机译：从ZeRO到HeRO：一次挽救一生命
7. Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model [O] . Tsung-Yuan Hsu, Chi-Liang Liu, Hung-yi Lee 2019

机译：多语言表示模型交叉思考学习零拍摄阅读理解

From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers

摘要

著录项

相似文献

相关主题

期刊订阅