BERTologiCoMix: How does Code-Mixing interact with Multilingual BERT?

机译：Bertologicomix：如何与多语种伯爵相互作用？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Models such as mBERT and XLMR have shown success in solving Code-Mixed NLP tasks even though they were not exposed to such text during pretraining. Code-Mixed NLP models have relied on using synthetically generated data along with naturally occurring data to improve their performance. Finetun-ing mBERT on such data improves it's code-mixed performance, but the benefits of using the different types of Code-Mixed data aren't clear. In this paper, we study the impact of fine-tuning with different types of code-mixed data and outline the changes that occur to the model during such finetuning. Our findings suggest that using naturally occurring code-mixed data brings in the best performance improvement after finetuning and that finetuning with any type of code-mixed text improves the respon-sivity of it's attention heads to code-mixed text inputs.

机译：即使在预先训练期间没有暴露在这些文本中，诸如Mbert和XLMR之类的模型也在解决代码混合的NLP任务方面取得了成功。代码混合的NLP模型依赖于使用综合生成的数据以及天然存在的数据来提高其性能。 FineTun-ing Mbert在这些数据上提高了代码混合性能，但使用不同类型的代码混合数据的好处尚不清楚。在本文中，我们研究了微调与不同类型的代码混合数据的影响，并概述了在这种FineTuning期间模型所发生的变化。我们的研究结果表明，使用自然发生的代码混合数据在FineTuning之后带来了最佳性能改进，并且具有任何类型的码混合文本的FineTuning将其注意力头的响应性提高到Code-Mixed文本输入。

著录项

来源
《Workshop on Domain Adaptation for NLP》|2021年|111-121|共11页
会议地点
作者
Sebastin Santy; Anirudh Srinivasan; Monojit Choudhury;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Lessons From Neuro-(a)-Typical Brains: Universal Multilingualism, Code-Mixing, Recombination, and Executive Functions [J] . Aboh Enoch O. Frontiers in Psychology . 2020,第2期

机译：来自神经（a）的课程 - 术语：普遍的多语言，码混合，重组和执行功能
2. Lessons From Neuro-(a)-Typical Brains: Universal Multilingualism, Code-Mixing, Recombination, and Executive Functions [J] . Enoch O. Aboh Frontiers in Psychology . 2020,第a期

机译：来自神经（a）的课程 - 术语大脑：普遍的多语言，码混合，重组和执行功能
3. Code-mixing and language dominance: bilingual, trilingual and multilingual children compared [J] . Poeste Meike, Mueller Natascha, Gil Laia Arnaus International Journal of Multilingualism . 2019,第4期

机译：混合代码和语言优势：双语，三语和多语儿童的比较
4. FinEst BERT and CroSloEngual BERT Less Is More in Multilingual Models [C] . Matej Ulcar, Marko Robnik-Sikonja International conference on text, speech, and dialogue . 2020

机译：多语言模型中的FinEst BERT和CroSloEngual BERT少即是多
5. Interactive competence in a multilingual English language school. [D] . Fairley, Michael Shawn. 1999

机译：多语言英语学校的互动能力。
6. Lessons From Neuro-(a)-Typical Brains: Universal Multilingualism Code-Mixing Recombination and Executive Functions [O] . Enoch O. Aboh 2020

机译：来自神经（a）型典型大脑的经验教训：通用的多种语言代码混合重组和执行功能
7. How Multilingual is Multilingual BERT? [O] . Telmo Pires, Eva Schlinger, Dan Garrette 2019

机译：多语种伯特多种语言如何？

BERTologiCoMix: How does Code-Mixing interact with Multilingual BERT?

摘要

著录项

相似文献

相关主题

期刊订阅