Cloze-driven Pretraining of Self-attention Networks

机译：推动了自我关注网络的预借预借鉴

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a new approach for pretraining a bi-directional transformer model that provides significant performance gains across a variety of language understanding problems. Our model solves a cloze-style word reconstruction task, where each word is ablated and must be predicted given the rest of the text. Experiments demonstrate large performance gains on GLUE and new state of the art results on NEK as well as constituency parsing benchmarks, consistent with BERT. We also present a detailed analysis of a number of factors that contribute to effective pretraining, including data domain and size, model capacity, and variations on the cloze objective.

机译：我们提出了一种预先绘制双向变压器模型的新方法，可在各种语言理解问题上提供显着性能。我们的模型解决了一个强化样式的词重建任务，其中每个单词都被烧蚀，必须在给出其余文本的情况下预测。实验证明了胶水和新的最新状态的巨大性能导致NEK以及选区解析基准，与BERT一致。我们还对许多因素进行了详细分析，这些因素有助于有效预测，包括数据领域和大小，模型容量以及隐冻目标的变化。

著录项

来源
《International joint conference on natural language processing》|2019年|cxxxviii p. 5174-5821|共10页
会议地点
作者
Alexei Baevski; Sergey Edunov; Yinhan Liu; Luke Zettlemoyer; Michael Auli;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Document-Level Biomedical Relation Extraction Leveraging Pretrained Self-Attention Structure and Entity Replacement: Algorithm and Pretreatment Method Validation Study [J] . Xiaofeng Liu, Jianye Fan, Shoubin Dong JMIR Medical Informatics . 2020,第5期

机译：文档级生物医学关系提取利用普拉净化的自我关注结构和实体替代：算法和预处理方法验证研究
2. High accuracy data-driven heliostat calibration and state prediction with pretrained deep neural networks [J] . Pargmann Max, Quinto Daniel Maldonado, Schwarzboezl Peter, Solar Energy . 2021,第Apra期

机译：高精度数据驱动的Heliostat校准和状态预测与预磨料的深层神经网络
3. Three stage cervical cancer classifier based on hybrid ensemble learning with modified binary PSO using pretrained neural networks [J] . Singh Sanjay Kumar, Goyal Anjali The imaging science journal . 2020,第1a2期

机译：采用预磨性神经网络修改二元PSO的三阶段宫颈癌分类器
4. Cloze-driven Pretraining of Self-attention Networks [C] . Alexei Baevski, Sergey Edunov, Yinhan Liu, International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：完形驱动的自我注意网络预训练
5. Incremental Learning with Sample Generation from Pretrained Networks [D] . Patil, Rishabh. 2020

机译：从佩带网络中的样本生成增量学习
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. Cloze-driven Pretraining of Self-attention Networks [O] . Alexei Baevski, Sergey Edunov, Yinhan Liu, 2019

机译：推动了自我关注网络的预借预借鉴

Cloze-driven Pretraining of Self-attention Networks

摘要

著录项

相似文献

相关主题

期刊订阅