This is a BERT. Now there are several of them. Can they generalize to novel words?

机译：这是伯特。现在有几个。他们可以推广到新颖的话语吗？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, large-scale pre-trained neural network models such as BERT have achieved many state-of-the-art results in natural language processing. Recent work has explored the linguistic capacities of these models. However, no work has focused on the ability of these models to generalize these capacities to novel words. This type of generalization is exhibited by humans (Berko, 1958), and is intimately related to morphology-humans are in many cases able to identify inflections of novel words in the appropriate context. This type of morphological capacity has not been previously tested in BERT models, and is important for morphologically-rich languages, which are under-studied in the literature regarding BERT's linguistic capacities. In this work, we investigate this by considering monolingual and multilingual BERT models' abilities to agree in number with novel plural words in English, French, German, Spanish, and Dutch. We find that many models are not able to reliably determine plurality of novel words, suggesting potential deficiencies in the morphological capacities of BERT models.

机译：最近，诸如BERT的大型预训练的神经网络模型已经实现了许多最先进的自然语言处理。最近的工作探索了这些模型的语言能力。然而，没有任何工作侧重于这些模型将这些能力概括为新型词语的能力。这种类型的概括由人类（Berko，1958）展示，与形态学 - 人类密切相关，在许多情况下，能够在适当的背景下识别新颖单词的拐点。这种类型的形态容量以前尚未在伯特模型中进行测试，对形态学的语言很重要，这在文献中研究了关于BERT的语言能力。在这项工作中，我们通过考虑单语和多语言BERT模型的能力来调查这一点，以便用英语，法语，德语，西班牙语和荷兰语的新型复数单词同意。我们发现许多模型无法可靠地确定多个新颖的单词，这表明伯特模型的形态能力的潜在不足。

著录项

来源
《BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP》|2020年|333-341|共9页
会议地点
作者
Coleman Haley;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Commentary: Cluster trials: a few words on why and how to do them. [J] . Clarke M International Journal of Epidemiology: Official Journal of the International Epidemiological Association . 2009,第1期

机译：评论：集群试验：关于为什么以及如何做的几句话。
2. Generalized de Bruijn words for primitive words and powers [J] . Au Yu Hin Discrete mathematics . 2015,第12期

机译：广义de Bruijn词用于原始词和幂
3. Generalized Lyndon factorizations of infinite words [J] . Burcroff Amanda, Winsor Eric Theoretical computer science . 2020,第期

机译：无限词语的广义林德症案
4. Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model [C] . Thomas Schaaf European conference on speech communication and technology . 2001

机译：使用广义单词模型和语义类语言模型检测OOV单词
5. Formalization and implementation of generalized constraint language for realization of computing with words. [D] . Sahebkar Khorasani, Elham. 2012

机译：通用约束语言的形式化和实现，以实现带字的计算。
6. Dermatopathology: An abridged compendium of words. A discussion of them and opinions about them. Part 8 (P-S) [O] . Bruce J. Hookerman 2015

机译：皮肤病理学：单词摘要。对它们的讨论以及对它们的看法。第8部分（P-S）
7. Dermatopathology: An abridged compendium of words. A discussion of them and opinions about them. Part 9 (T-Z) [O] . Bruce Hookerman 2015

机译：皮肤病病：一个删节的单词纲要。对他们的讨论和对他们的看法。第9部分（T-Z）

This is a BERT. Now there are several of them. Can they generalize to novel words?

摘要

著录项

相似文献

相关主题

期刊订阅