Analyzing the Domain Robustness of Pretrained Language Models, Layer by Layer

机译：分析预磨料语言模型的域稳健性，层层

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The robustness of pretrained language models (PLMs) is generally measured using performance drops on two or more domains. However, we do not yet understand the inherent robustness achieved by contributions from different layers of a PLM. We systematically analyze the robustness of these representations layer by layer from two perspectives. First, we measure the robustness of representations by using domain divergence between two domains. We find that ⅰ) Domain variance increases from the lower to the upper layers for vanilla PLMs; ⅱ) Models continuously pretrained on domain-specific data (DAPT) (Gururangan ct al., 2020) exhibit more variance than their pretrained PLM counterparts: and that ⅲ) Distilled models (e.g.,DistilBERT) also show greater domain variance. Second, we investigate the robustness of representations by analyzing the encoded syntactic and semantic information using diagnostic probes. We find that similar layers have similar amounts of linguistic information for data from an unseen domain.

机译：预先训练的语言模型（PLMS）的稳健性通常使用两个或更多个域上的性能下降来测量。然而，我们尚未理解来自PLM不同层的贡献所取得的固有稳健性。我们通过两个观点来系统地分析这些表示层的鲁棒性。首先，我们通过在两个域之间使用域分歧来测量表示的稳健性。我们发现Ⅰ）域变异从较低的Vanilla PLMS的上层增加; Ⅱ）在域特定数据（DAPT）上连续净化的模型（Gururangan CT Al.，2020）表现出比其预制PLM对应物更多的差异：Ⅲ）蒸馏模型（例如，Distilbert）也显示出更大的域变异。其次，我们通过使用诊断探针分析编码的句法和语义信息来调查表示的鲁棒性。我们发现类似的层具有类似于看不见域的数据的语言信息量。

著录项

来源
《Workshop on Domain Adaptation for NLP》|2021年|222-244|共23页
会议地点
作者
Abhinav Ramesh Kashyap; Laiba Mehnaz; Bhavitvya Malik; Abdul Waheed; Devamanyu Hazarika; Min-Yen Kan; Rajiv Ratn Shah;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers [J] . Michael HENTSCHEL, Marc DELCROIX, Atsunori OGAWA, IEICE transactions on information and systems . 2019,第3期

机译：具有分解隐藏层的神经网络语言模型的基于特征的域自适应
2. Lattice statistics in three dimensions: Solution of layered dimer and layered domain wall models [J] . V. Popkov, Doochul Kim, H. Y. Huang, Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics . 1997,第4期

机译：三维统计的三个维度：分层二聚体和分层畴壁模型的解决方案
3. Analyzing the Boundary Thermal Resistance of Epitaxially Grown Fe2VAl/W Layers by Picosecond Time-Domain Thermoreflectance [J] . Hiroi Satoshi, Choi Seongho, Nishino Shunsuke, Journal of Electronic Materials . 2018,第6期

机译：PICOSECOND时间域热反射分析外延生长Fe2Val / W层的边界热阻
4. Robust spectral-domain EM modeling of distributed-source sensors in planar-layered media [C] . Kamalesh Sainath, Fernando L. Teixeira International Symposium on Antennas and Propagation . 2015

机译：平面分层介质中分布式源传感器的鲁棒光谱域EM建模
5. Molecular simulations and modeling of HIV-1 gp41 membrane spanning domain (MSD) in a model viral bilayer. [D] . Baker, Michelle Katherine. 2014

机译：模型病毒双层中HIV-1 gp41膜跨结构域（MSD）的分子模拟和建模。
6. Minimum-norm cortical source estimation in layered head models is robust against skull conductivity error [O] . Matti Stenroos, Olaf Hauk -1

机译：分层头部模型中的最小范数皮质源估计对颅骨电导率误差具有鲁棒性
7. Lattice Statistics in Three Dimensions: Exact Solution of Layered Dimer and Layered Domain Wall Models [O] . Popkov, V., Kim, Doochul, Huang, H. Y., 1997

机译：三维格点统计：分层二聚体的精确解和分层域墙模型
8. Study of high gain spherical shell ICF targets containing uniform layers of liquid deuterium tritium fuel. A numericial model for analyzing thermal layering of liquid mixtures of hydrogen isotopes inside a spherical inertial confinement fusion target: Final report [R] . Simpson, E. M. , Kim, K. 1994

机译：含有均匀液态氘氚燃料层的高增益球壳ICF靶的研究。用于分析球形惯性约束聚变靶内氢同位素液体混合物热分层的数值模型：最终报告

Analyzing the Domain Robustness of Pretrained Language Models, Layer by Layer

摘要

著录项

相似文献

相关主题

期刊订阅