Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Colin Raffel; Noam Shazeer; Adam Roberts; Katherine Lee; Sharan Narang; Michael Matena; Yanqi Zhou; Wei Li; Peter J. Liu

首页> 外文期刊>Journal of machine learning research >Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

【24h】

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

机译：用统一的文本到文本变压器探索转移学习的限制

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP). The effectiveness of transfer learning has given rise to a diversity of approaches, methodology, and practice. In this paper, we explore the landscape of transfer learning techniques for NLP by introducing a unified framework that converts all text-based language problems into a text-to-text format. Our systematic study compares pre-training objectives, architectures, unlabeled data sets, transfer approaches, and other factors on dozens of language understanding tasks. By combining the insights from our exploration with scale and our new “Colossal Clean Crawled Corpus”, we achieve state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more. To facilitate future work on transfer learning for NLP, we release our data set, pre-trained models, and code.

机译：转移学习，其中模型首先在富有的数据上进行预先训练，然后在下游任务上进行微调，因此是一种在自然语言处理（NLP）中的强大技术。转移学习的有效性引起了多样性的方法，方法和实践。在本文中，我们通过引入统一的框架来探讨NLP的传输学习技术景观，该框架将所有基于文本的语言问题转换为文本到文本格式。我们的系统研究比较了预训练目标，架构，未标记的数据集，转移方法以及数十个语言理解任务的其他因素。通过将我们的探索与规模和新的“巨大清洁爬行语料库”的洞察结合，我们实现了最先进的结果，涵盖了许多基准，涵盖了总结，问题应答，文本分类等等。为了促进未来的转移学习工作，我们释放了我们的数据集，预先训练的模型和代码。

著录项

来源
《Journal of machine learning research》 |2020年第a期|共67页
作者
Colin Raffel; Noam Shazeer; Adam Roberts; Katherine Lee; Sharan Narang; Michael Matena; Yanqi Zhou; Wei Li; Peter J. Liu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Two-port ideal power transferitors: a unified introduction to ideal transformer and gyrator [J] . Premoli A., Storace M. IEEE transactions on circuits and systems. II, Express briefs . 2004,第8期

机译：两端口理想功率传输器：理想变压器和回转器的统一介绍
2. Active Transfer Learning Network: A Unified Deep Joint Spectral–Spatial Feature Learning Model for Hyperspectral Image Classification [J] . Deng Cheng, Xue Yumeng, Liu Xianglong, IEEE Transactions on Geoscience and Remote Sensing . 2019,第3期

机译：主动转移学习网络：用于高光谱图像分类的统一的深度联合光谱空间特征学习模型
3. Unified and transferable description of dynamics of H(2)dissociative adsorption on multiple copper surfacesviamachine learning [J] . Zhu Lingjun, Zhang Yaolong, Zhang Liang, Physical chemistry chemical physics: PCCP . 2020,第25期

机译：H（2）多发性铜表面的分离吸附的统一和可转移的动态描述
4. Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing [C] . Ganesh Jawahar, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, Workshop on Computational Approaches to Linguistic Code-Switching . 2021

机译：使用合成码混合探索英语中英语的文本传感器文本变换器
5. Exploring the limits of optical microscopy: live cell and superresolution fluorescence microscopy of HIV-1 Transfer Between T lymphocytes Across the Virological Synapse [D] . McNerney, Gregory Paul 2011

机译：探索光学显微镜的局限性：跨病毒突触的T淋巴细胞之间HIV-1转移的活细胞和超高分辨率荧光显微镜
6. Exploring the limits of learning: Segregation of information integration and response selection is required for learning a serial reversal task [O] . Camilo Juan Mininni, B. Silvano Zanutto -1

机译：探索学习的局限性：学习串行逆向任务需要隔离信息集成和响应选择
7. Learning without limits:from problem solving towards a unified theory of learning [O] . Taatgen Niels Anne 1999

机译：学习无极限：从问题解决到统一的学习理论

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

摘要

著录项

相似文献

相关主题

期刊订阅