BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance

机译：BERTRAM：改进的单词嵌入对语境化模型的性能有很大影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pretraining deep language models has led to large performance gains in NLP. Despite this success, Schick and Schuetze (2020) recently showed that these models struggle to understand rare words. For static word embeddings, this problem has been addressed by separately learning representations for rare words. In this work, we transfer this idea to pretrained language models: We introduce BERTRAM, a powerful architecture based on BERT that is capable of inferring high-quality embeddings for rare words that are suitable as input representations for deep language models. This is achieved by enabling the surface form and contexts of a word to interact with each other in a deep architecture. Integrating BERTRAM into BERT leads to large performance increases due to improved representations of rare and medium frequency words on both a rare word probing task and three downstream tasks.

机译：NLP在预训练模式中取得了巨大的成绩。尽管取得了这样的成功，Schick和Schuetze（2020）最近表明，这些模型很难理解稀有词。对于静态单词嵌入，这个问题已经通过单独学习稀有单词的表示来解决。在这项工作中，我们将这一想法转移到预训练语言模型中：我们引入了BERTRAM，这是一种基于BERT的强大体系结构，能够推断出适合作为深层语言模型输入表示的稀有词的高质量嵌入。这是通过使一个词的表面形式和上下文在深层架构中相互作用来实现的。由于在稀有词探测任务和三个下游任务中改进了稀有词和中频词的表示，将BERTRAM集成到BERT中可以大大提高性能。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics》|2020年|3996-4007|共12页
会议地点
作者
Timo Schick; Hinrich Schuetze;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Improved Modelling of Soil Loss in El Badalah Basin: Comparing the Performance of the Universal Soil Loss Equation, Revised Universal Soil Loss Equation and Modified Universal Soil Loss Equation Models by Using the Magnetic and Gravimetric Prospection Outcomes [J] . Naima Azaiez 地球科学和环境保护期刊（英文） . 2021,第004期
2. AN IMPROVED LAND-SURFACE PROCESS MODEL AND ITS SIMULATION EXPERIMENT——PART I:LAND-SURFACE PROCESS MODEL AND ITS“OFF-LINE”TESTS AND PERFORMANCE ANALYSES [J] . 张晶, 丁一汇气象学报：英文版 . 1999,第003期
3. Adaptive cross-contextual word embedding for word polysemy with unsupervised topic modeling [J] . Li Shuangyin, Pan Rong, Luo Haoyu, Knowledge-Based Systems . 2021,第Apra22期

机译：与无监督主题建模的自适应交叉上下文词嵌入Word Polysemy
4. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
5. Improving biterm topic model with word embeddings [J] . Jiajia Huang, Min Peng, Pengwei Li, World Wide Web . 2020,第6期

机译：用Word Embeddings改进Biterm主题模型
6. Obtaining Better Static Word Embeddings Using Contextual Embedding Models [C] . Prakhar Gupta, Martin Jaggi International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics . 2021

机译：使用上下文嵌入模型获取更好的静态单词嵌入
7. An Analysis of Gender Bias in K-12 Assigned Literature Through Comparison of Non-Contextual Word Embedding Models [D] . Mohan, Preeti. 2021

机译：通过非语境词嵌入模型比较k-12分配文献中的性别偏差分析
8. Speculation Detection for Chinese Clinical Notes: Impacts of Word Segmentation and Embedding Models [O] . Shaodian Zhang, Tian Kang, Xingting Zhang, -1

机译：中医临床笔记的推测检测：分词和嵌入模型的影响
9. Diachronic Sense Modeling with Deep Contextualized Word Embeddings: An Ecological View [O] . Renfen Hu, Shen Li, Shichen Liang 2019

机译：深层语境化词嵌入的历史思想建模：生态观点
10. Embedded Sensors and Controls to Improve Component Performance and Reliability - System Dynamics Modeling and Control System Design. [R] . Melin, A. M., Kisner, R., Fugate, D. L. 2013

机译：嵌入式传感器和控制器，以提高组件性能和可靠性 - 系统动力学建模和控制系统设计。

BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance

摘要

著录项

相似文献

相关主题

期刊订阅