PALM: Pre-training an AutoencodingAutoregressive Language Model for Context-conditioned Generation

机译：Palm：预先培训用于上下文化生成的自动编码和自动评级语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Self-supervised pre-training, such as BERT (Devlin et al., 2018), MASS (Song et al., 2019) and BART (Lewis et al., 2019), has emerged as a powerful technique for natural language understanding and generation. Existing pre-training techniques employ au-toencoding and/or autoregressive objectives to train Transformer-based models by recovering original word tokens from corrupted text with some masked tokens. The training goals of existing techniques are often inconsistent with the goals of many language generation tasks, such as generative question answering and conversational response generation, for producing new text given context. This work presents PALM with a novel scheme that jointly pre-trains an autoencod-ing and autoregressive language model on a large unlabclcd corpus, specifically designed for generating new text conditioned on context. The new scheme alleviates the mismatch introduced by the existing denoising scheme between pre-training and fine-tuning where generation is more than reconstructing original text. An extensive set of experiments show that PALM achieves new state-of-the-art results on a variety of language generation benchmarks covering generative question answering (Rank 1 on the official MARCO leaderboard), abstractive summarization on CNN/DailyMail as well as Gigaword, question generation on SQuAD, and conversational response generation on Cornell Movie Dialogues.

机译：自我监督的预训练，如伯特（Devlin等，2018），Mass（Song et al。，2019）和Bart（Lewis等，2019），已成为自然语言理解的强大技术一代。现有的预训练技术采用AU-Toescoding和/或自动增加目标来通过使用一些屏蔽令牌从损坏的文本中恢复原始字令牌来培训基于变换器的模型。现有技术的培训目标通常与许多语言生成任务的目标不一致，例如生成问题应答和会话响应生成，用于产生上下文的新文本。这项工作介绍了一种新颖的计划，共同预先在大型unlabclcd语料库上共同预先预先训练一个自动增加的语言模型，专门用于在上下文上生成新的文本。新方案减轻了在预训练和微调之间的现有去噪方案引入的不匹配，在那里的那些时不仅仅是重建原文。一组广泛的实验表明，Palm在涵盖生成问题的各种语言生成基准上实现了新的最先进的结果（在官方Marco排行榜上排名第一），CNN / Dailymail以及Gigaword的抽象摘要，康奈尔电影对话中队的问题生成和对话响应生成。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing》|2020年|8681-8691|共11页
会议地点
作者
Bin Bi; Chenliang Li; Chen Wu; Ming Yan; Wei Wang; Songfang Huang; Fei Huang; Luo Si;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:54:33

相似文献

外文文献
中文文献
专利

1. Summary of Research Methods on Pre-Training Models of Natural Language Processing [J] . Yu Xiao, Zhezhi Jin Open Access Library Journal . 2021,第7期

机译：自然语言处理预培训模型研究方法综述
2. RUNTIME CODE GENERATION FOR INTERPRETED DOMAIN-SPECIFIC MODELING LANGUAGES [J] . Tom Meyer, Tobias Helms, Tom Warnke, Proceedings of the Workshop on Principles of Advanced and Distributed Simulation . 2018,第CDaROM1期

机译：用于解释域的特定于域的建模语言的运行时代码
3. Stochastic Language Generation in Dialogue using Factored Language Models [J] . Fran?ois Mairess, Steve Youn Computational linguistics . 2014,第4期

机译：使用因式语言模型进行对话中的随机语言生成
4. Unified Language Model Pre-training for Natural Language Understanding and Generation [C] . Li Dong, Nan Yang, Wenhui Wang, Conference on Neural Information Processing Systems . 2020

机译：统一语言模型预培训自然语言理解和一代
5. A foundation for general-purpose natural language generation: Sentence realization using probabilistic models of language. [D] . Langkilde-Geary, Irene. 2003

机译：通用自然语言生成的基础：使用语言的概率模型实现句子。
6. Automated code generation from LEMS the general purpose model specification language underpinning NeuroML2 [O] . Boris Marin, Padraig Gleeson, Matteo Cantarelli, 2014

机译：从LEMS（NeuroML2的基础通用模型规范语言）自动生成代码
7. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension [O] . Mike Lewis, Yinhan Liu, Naman Goyal, 2020

机译：BART：用于自然语言生成，翻译和理解的序列对序列预培训
8. Functional Model for Fourth Generation Languages [R] . Fisher, G. E. 1986

机译：第四代语言的功能模型

PALM: Pre-training an AutoencodingAutoregressive Language Model for Context-conditioned Generation

摘要

著录项

相似文献

相关主题

期刊订阅