Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

机译：通过主动学习的低资源自然语言理解微调伯特

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, leveraging pre-trained Transformer based language models in down stream, task specific models has advanced state of the art results in natural language understanding tasks. However, only a little research has explored the suitability of this approach in low resource settings with less than 1,000 training data points. In this work, we explore fine-tuning methods of BERT - a pre-trained Transformer based language model - by utilizing pool-based active learning to speed up training while keeping the cost of labeling new data constant. Our experimental results on the GLUE data set show an advantage in model performance by maximizing the approximate knowledge gain of the model when querying from the pool of unlabeled data. Finally, we demonstrate and analyze the benefits of freezing layers of the language model during fine-tuning to reduce the number of trainable parameters, making it more suitable for low-resource settings.

机译：最近，利用预训练的基于变压器的语言模型在下游，任务特定模型具有先进的技术，导致自然语言理解任务。然而，只有一点研究探索了这种方法在低资源设置中的适用性，培训数据点小于1,000。在这项工作中，我们探讨了BERT的微调方法 - 通过利用基于池的主动学习来加速训练的频率 - 一种基于训练的变压器的语言模型，同时保持标记新数据常数的成本。我们对胶水数据集的实验结果通过在从未标记数据池查询时最大化模型的近似知识增益，显示了模型性能的优势。最后，我们在微调过程中展示和分析了语言模型的冷冻层的好处，以减少培训参数的数量，使其更适合低资源设置。

著录项

来源
《International Conference on Computational Linguistics》|2020年|1158-1171|共14页
会议地点
作者
Daniel Griesshaber; Johannes Maucher; Ngoc Thang Vu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. RBN: enhancement in language attribute prediction using global representation of natural language transfer learning technology like Google BERT [J] . Chiranjib Sur SN Applied Sciences . 2020,第1期

机译：RBN：使用自然语言迁移学习技术（例如Google BERT）的全局表示来增强语言属性预测
2. Combining active and semi-supervised learning for spoken language understanding [J] . Gokhan Tur, Dilek Hakkani-Tur, Robert E. Schapire Speech Communication . 2005,第2期

机译：结合主动学习和半监督学习以了解口语
3. KoRASA: Pipeline Optimization for Open-Source Korean Natural Language Understanding Framework Based on Deep Learning [J] . Myeong-Ha Hwang, Jikang Shin, Hojin Seo, Mobile information systems . 2021,第a期

机译：Korasa：基于深度学习的开源韩国自然语言理解框架的管道优化
4. Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks [C] . Zi-Yi Dou, Keyi Yu, Antonios Anastasopoulos International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：研究用于资源匮乏的自然语言理解任务的元学习算法
5. Learning Deep Representations for Low-resource Cross-lingual Natural Language Processing [D] . Chen, Xilun. 2019

机译：学习深度表示资源少的跨语言自然语言处理
6. Semi-Supervised Learning of Statistical Models for Natural Language Understanding [O] . Deyu Zhou, Yulan He -1

机译：对自然语言理解的统计模型的半监督学习
7. Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks [O] . Zi-Yi Dou, Keyi Yu, Antonios Anastasopoulos 2019

机译：调查元学学习算法，了解低资源自然语言理解任务
8. Never-Ending Learning for Deep Understanding of Natural Language. [R] . Mitchell, T. 2017

机译：从不结束学习对自然语言的深刻理解。

Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅