Mask-Predict: Parallel Decoding of Conditional Masked Language Models

机译：屏蔽预测：条件屏蔽语言模型的并行解码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most machine translation systems generate text autoregressively from left to right. We, instead, use a masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a partially masked target translation. This approach allows for efficient iterative decoding, where we first predict all of the target words non-auloregressively, and then repeatedly mask out and regenerate the subset of words that the model is least confident about. By applying this strategy for a constant number of iterations, our model improves state-of-the-art performance levels for non-autoregressive and parallel decoding translation models by over 4 BLEU on average. It is also able to reach within about 1 BLEU point of a typical left-to-right transformer model, while decoding significantly faster.~1

机译：大多数机器翻译系统从左到右自动生成文本。取而代之的是，我们使用掩蔽的语言建模目标来训练模型，以根据输入文本和部分掩蔽的目标翻译来预测目标词的任何子集。这种方法允许进行有效的迭代解码，在这种解码中，我们首先以非渐进方式预测所有目标词，然后反复屏蔽掉并重新生成模型最不信任的词子集。通过对恒定数量的迭代应用此策略，我们的模型将非自回归和并行解码转换模型的最新性能水平平均提高了4个BLEU。它也能够达到典型左右转换器模型的大约1 BLEU点之内，而解码速度则要快得多。〜1

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|6111-6120|共10页
会议地点
作者
Marjan Ghazvininejad; Omer Levy; Yinhan Liu; Luke Zettlemoyer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Parallelizing and optimizing neural Encoder-Decoder models without padding on multi-core architecture [J] . Yuchen Qiao, Kazuma Hashimoto, Akiko Eriguchi, Future generation computer systems . 2020,第Jula期

机译：并行化和优化神经编码器-解码器模型，而无需在多核体系结构上进行填充
2. Models and Languages for Description of Parallel Processes [J] . V. P. Kutepov Journal of computer & systems sciences international . 2018,第3期

机译：描述并行过程的模型和语言
3. Coordination models and languages: from parallel computing to self-organisation [J] . ANDREA OMICINI, MIRKO VIROLI The Knowledge Engineering Review . 2011,第1期

机译：协调模型和语言：从并行计算到自组织
4. Mask-Predict: Parallel Decoding of Conditional Masked Language Models [C] . Marjan Ghazvininejad, Omer Levy, Yinhan Liu, International joint conference on natural language processing . 2019

机译：掩码 - 预测：条件屏蔽语言模型的并行解码
5. Generating Vocabulary Sets for Implicit Language Learning Using Masked Language Modeling [D] . Edgar, Vatricia. 2020

机译：使用屏蔽语言建模生成隐式语言学习的词汇集
6. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
7. Mask-Predict: Parallel Decoding of Conditional Masked Language Models [O] . Marjan Ghazvininejad, Omer Levy, Yinhan Liu, 2019

机译：掩码 - 预测：条件屏蔽语言模型的并行解码

Mask-Predict: Parallel Decoding of Conditional Masked Language Models

摘要

著录项

相似文献

相关主题

期刊订阅