Maximal Multiverse Learning for Promoting Cross-Task Generalization of Fine-Tuned Language Models

机译：促进微调语言模型的交叉任务泛化的最大多层学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Language modeling with BERT consists of two phases of (ⅰ) unsupervised pre-training on unlabeled text, and (ⅱ) fine-tuning for a specific supervised task. We present a method that leverages the second phase to its fullest, by applying an extensive number of parallel classifier heads, which are enforced to be orthogonal, while adaptively eliminating the weaker heads during training. We conduct an extensive inter- and intra-dataset evaluation, showing that our method improves the generalization ability of BERT, sometimes leading to a +9% gain in accuracy. These results highlight the importance of a proper fine-tuning procedure, especially for relatively smaller-sized datasets. Our code is attached as supplementary.

机译：用伯特语言建模包括两个阶段（Ⅰ）未推测的未标记文本的预测，（Ⅱ）针对特定监督任务的微调。我们介绍一种通过应用广泛数量的并联分类器头来利用第二阶段来充分利用第二阶段的方法，这在训练期间自适应地消除较弱的头部。我们进行了广泛的间数据集和数据间评估，表明我们的方法提高了伯特的泛化能力，有时会导致+ 9％的准确性。这些结果突出了适当的微调过程的重要性，特别是对于相对较小的数据集。我们的代码作为补充。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2021年|187-199|共13页
会议地点
作者
ItzikMalkiel; LiorWolf;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:56:59

相似文献

外文文献
中文文献
专利

1. Pinyin Spelling Promotes Reading Abilities of Adolescents Learning Chinese as a Foreign Language: Evidence From Mediation Models [J] . Huimin Xiao, Caihua Xu, Hetty Rusamy Frontiers in Psychology . 2020,第a期

机译：拼音拼写促进青少年学习中文作为外语的阅读能力：来自中介模型的证据
2. Pinyin Spelling Promotes Reading Abilities of Adolescents Learning Chinese as a Foreign Language: Evidence From Mediation Models [J] . Xiao Huimin, Xu Caihua, Rusamy Hetty Frontiers in Psychology . 2020,第2期

机译：拼音拼写促进青少年学习中文作为外语的阅读能力：来自中介模型的证据
3. Early Start Denver Model for Young Children with Autism: Promoting Language, Learning, and Engagement [J] . Patricia Howlin Child and Adolescent Mental Health . 2011,第2期

机译：自闭症幼儿的早期丹佛模型：促进语言，学习和参与
4. Computer Assisted Language Learning and Gray Model for Promoting Phonics Learning in Continuing Education [C] . Hui-Yi Liang, Chih-Chien Yang International Conference on Engineering and Technologynnovation . 2013

机译：促进持续教育学习学习的计算机辅助语言学习与灰色模型
5. The clicker technique: Promoting learning and generalization while conserving teaching time [D] . Anderson, Lindsay Senior 2010

机译：答题器技术：在节省教学时间的同时促进学习和推广
6. Pinyin Spelling Promotes Reading Abilities of Adolescents Learning Chinese as a Foreign Language: Evidence From Mediation Models [O] . Huimin Xiao, Caihua Xu, Hetty Rusamy 2020

机译：拼音拼写促进青少年学习中文作为外语的阅读能力：来自中介模型的证据
7. Promoting Minority Language Learning Within Mainstream Primary Schools: A Five-Step Constructivist and IT-Enhanced Model [O] . Irina Moira Cavaion 2017

机译：在主流小学中促进少数民族语言学习：一个五步建构主义和IT增强模型

Maximal Multiverse Learning for Promoting Cross-Task Generalization of Fine-Tuned Language Models

摘要

著录项

相似文献

相关主题

期刊订阅