Revealing the Dark Secrets of BERT

机译：揭示BERT的黑暗秘密

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

BERT-based architectures currently give state-of-the-art performance on many NLP tasks, but little is known about the exact mechanisms that contribute to its success. In the current work, we focus on the interpretation of self-attention, which is one of the fundamental underlying components of BERT. Using a subset of GLUE tasks and a set of handcrafted features-of-interest, we propose the methodology and carry out a qualitative and quantitative analysis of the information encoded by the individual BERT's heads. Our findings suggest that there is a limited set of attention patterns that are repeated across different heads, indicating the overall model overparametriza-tion. While different heads consistently use the same attention patterns, they have varying impact on performance across different tasks. We show that manually disabling attention in certain heads leads to a performance improvement over the regular fine-tuned BERT models.

机译：目前，基于BERT的体系结构可在许多NLP任务上提供最先进的性能，但对于促成其成功的确切机制知之甚少。在当前的工作中，我们专注于自我注意力的解释，这是BERT的基本基础组成部分之一。我们使用GLUE任务的子集和一组手工制作的感兴趣功能，提出了该方法，并对由各个BERT头编码的信息进行了定性和定量分析。我们的发现表明，在不同的头脑中重复出现的注意力模式非常有限，这表明整个模型的参数设置过高。尽管不同的负责人始终使用相同的注意力模式，但它们对不同任务的绩效产生不同的影响。我们表明，与常规的微调BERT模型相比，手动禁用某些头部的注意力可以提高性能。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|4364-4373|共10页
会议地点
作者
Olga Kovaleva; Alexey Romanov; Anna Rogers; Anna Rumshisky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Dark Matter's secret liaisons: phenomenology of a dark U(1) sector with bound states [J] . Cirelli Marco, Panci Paolo, Petraki Kalliopi, Journal of Cosmology and Astroparticle Physics . 2017,第5期

机译：暗物质的秘密联络：带有界定的黑暗U（1）个行业的现象学
2. DARK SECRETS, DARK SKIES [J] . Warren Julian Lighting . 2013,第6期

机译：黑暗的秘密，黑暗的天空
3. Revealing the secrets of Qatar Airways' success [J] . Strategic Direction . 2021,第3期

机译：揭示卡塔尔航空公司成功的秘密
4. Revealing the Dark Secrets of BERT [C] . Olga Kovaleva, Alexey Romanov, Anna Rogers, International joint conference on natural language processing . 2019

机译：揭示伯特的黑暗秘密
5. Revealing Victoria's Secret: A hermeneutic exploration of female New Luxury consumers. [D] . Granot, Elad. 2006

机译：揭示维多利亚的秘密：对女性“新奢侈品”消费者的诠释性探索。
6. The Stem Cell Revolution Revealing Protozoan Parasites’ Secrets and Paving the Way towards Vaccine Development [O] . Alena Pance 2021

机译：干细胞革命揭示了原生动物寄生虫的秘密铺平了疫苗发展的途径
7. André Brochu : Anne Hébert. Le secret de vie et de mort, Ottawa, Les Presses de l’Université d’Ottawa, « Oeuvres et auteurs », 2000, 284 p. [O] . Everett, Jane 2000

机译：andréBrochu：anneHébert。生死的秘密，渥太华，渥太华大学出版社，“作品和作者”，2000年，284页。

Revealing the Dark Secrets of BERT

摘要

著录项

相似文献

相关主题

期刊订阅