FinEst BERT and CroSloEngual BERT Less Is More in Multilingual Models

机译：多语言模型中的FinEst BERT和CroSloEngual BERT少即是多

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large pretrained masked language models have become state-of-the-art solutions for many NLP problems. The research has been mostly focused on English language, though. While massively multilingual models exist, studies have shown that monolingual models produce much better results. We train two trilingual BERT-like models, one for Finnish, Estonian, and English, the other for Croatian, Slovenian, and English. We evaluate their performance on several downstream tasks, NER, POS-tagging, and dependency parsing, using the multilingual BERT and XLM-R as baselines. The newly created FinEst BERT and CroSloEngual BERT improve the results on all tasks in most monolingual and cross-lingual situations.

机译：大型的预训练掩蔽语言模型已成为解决许多NLP问题的最新解决方案。不过，这项研究主要集中在英语方面。尽管存在大量的多语言模型，但研究表明，单语言模型会产生更好的结果。我们训练了两种类似BERT的三种语言，一种用于芬兰语，爱沙尼亚语和英语，另一种用于克罗地亚语，斯洛文尼亚语和英语。我们使用多语言BERT和XLM-R作为基准，评估它们在多个下游任务，NER，POS标记和依赖项解析上的性能。新创建的FinEst BERT和CroSloEngual BERT可改善大多数单语和跨语种情况下所有任务的结果。

著录项

来源
《International conference on text, speech, and dialogue》|2020年|104-111|共8页
会议地点
作者
Matej Ulcar; Marko Robnik-Sikonja;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Contextual embeddings; BERT model; Less-resourced languages; NLP;

机译：上下文嵌入; BERT模型;资源较少的语言;自然语言处理;

相似文献

外文文献
中文文献
专利

1. Multilingual evaluation of pre-processing for BERT-based sentiment analysis of tweets [J] . Pota Marco, Ventura Mirko, Fujita Hamido, Expert systems with applications . 2021,第Nova期

机译：多语言评价推文的伯特基情感分析的预处理
2. Multilingual emoji prediction using BERT for sentiment analysis [J] . Toshiki Tomihira, Atsushi Otsuka, Akihiro Yamashita, International journal of web information systems . 2020,第3期

机译：多语种Emoji使用BERT进行情感分析预测
3. Monolingual and multilingual topic analysis using LDA and BERT embeddings [J] . Xie Qing, Zhang Xinyuan, Ding Ying, Journal of informetrics . 2020,第3期

机译：使用LDA和BERT Embeddings的单语和多语言主题分析
4. Vietnamese Question Answering System f rom Multilingual BERT Models to Monolingual BERT Model [C] . Nguyen Thi Mai Trang, Maxim Shcherbakov International Conference System Modeling and Advancement in Research Trends . 2020

机译：越南问题回答系统F ROM多语言BERT模型到单晶伯特模型
5. LogBert: Log Anomaly Detection via Bert [D] . Guo, Haixuan. 2021

机译：Logbert：通过BERT对数异常检测
6. Automatic Truecasing of Video Subtitles Using BERT: A Multilingual Adaptable Approach [O] . Ricardo Rei, Nuno Miguel Guerreiro, Fernando Batista -1

机译：使用BERT自动对视频字幕进行装箱：一种多语言自适应方法
7. What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and multilingual models [O] . Wietse de Vries, Andreas van Cranenburgh, Malvina Nissim 2020

机译：Bert的层是什么特别的？仔细观察单声道和多语言模型中的NLP管道

FinEst BERT and CroSloEngual BERT Less Is More in Multilingual Models

摘要

著录项

相似文献

相关主题

期刊订阅