Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models

机译：多语种喇嘛：调查多语种预留语言模型的知识

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, it has been found that monolingual English language models can be used as knowledge bases. Instead of structural knowledge base queries, masked sentences such as "Paris is the capital of [MASK]" are used as probes. We translate the established benchmarks TREx and GoogleRE into 53 languages. Working with mBERT, we investigate three questions, (ⅰ) Can mBERT be used as a multilingual knowledge base? Most prior work only considers English. Extending research to multiple languages is important for diversity and accessibility, (ⅱ) Is mBERT's performance as knowledge base language-independent or does it vary from language to language? (ⅲ) A multilingual model is trained on more text, e.g., mBERT is trained on 104 Wikipedias. Can mBERT leverage this for better performance? We find that using mBERT as a knowledge base yields varying performance across languages and pooling predictions across languages improves performance. Conversely, mBERT exhibits a language bias; e.g.. when queried in Italian, it tends to predict Italy as the country of origin.

机译：最近，已经发现单格式语言模型可以用作知识库。代替结构知识库查询，诸如“巴黎是[掩码]的首都之类的屏蔽句子用作探针。我们将建立的基准Trex和GoogleRe翻译成53种语言。我们使用MBERT，我们调查了三个问题，（Ⅰ）MBERT可以用作多语言知识库？大多数事先工作只考虑英语。扩展对多种语言的研究对于多样性和可访问性很重要，（Ⅱ）是MBERT作为知识库语言无关的表现，或者它与语言语言有所不同吗？（三）多语言模型培训更多文本，例如，MBENT在104维基百科培训。 MBERT可以利用这一点以实现更好的性能吗？我们发现，使用MBERT作为知识库产生不同的性能，跨语言的汇集预测可以提高性能。相反，Mbert表现出语言偏见;例如..当在意大利语中查询时，它往往将意大利预测为原籍国。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2021年|3250-3258|共9页
会议地点
作者
Nora Kassner; Philipp Dufter; Hinrich Schuetze;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
2. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期

机译：资源非常少的语言Chaha通过多语言深层神经网络建模方法研究自动语音识别系统
3. Language clustering and knowledge sharing in multilingual organizations: A social perspective on language [J] . Farhan Ahmad, Gunilla Widen Journal of Information Science . 2015,第4期

机译：多语言组织中的语言聚类和知识共享：语言的社会视角
4. X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models [C] . Zhengbao Jiang, Antonios Anastasopoulos, Jun Araki, Conference on Empirical Methods in Natural Language Processing . 2020

机译：X因子：来自预磨料语言模型的多语种事实知识检索
5. Planning language: Competing models of education in multilingual countries with English as a second language. [D] . Raddaoui, Ali Hechmi. 1988

机译：计划语言：英语为第二语言的多语言国家教育模式的竞争。
6. Intensity of Multilingual Language Use Predicts Cognitive Performance in Some Multilingual Older Adults [O] . Anna Pot, Merel Keijzer, Kees de Bot 2018

机译：使用多种语言的强度可以预测一些使用多种语言的成年人的认知能力
7. Developing knowledge on languages and cultures in intercomprehensive multilingual chatrooms: which role in the enhancement of multilingual education? [O] . Espinha Ângela, Araújo e Sá Maria Helena, De Carlo Maddalena 2017

机译：在跨语言的多语种聊天室中发展有关语言和文化的知识：在增强多语种教育中发挥什么作用？

Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models

摘要

著录项

相似文献

相关主题

期刊订阅