Cloze Distillation: Improving Neural Language Models with Human Next-Word Predictions

机译：强化蒸馏：改善人类下一词预测的神经语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Contemporary autoregressive language models (LMs) trained purely on corpus data have been shown to capture numerous features of human incremental processing. However, past work has also suggested dissociations between corpus probabilities and human next-word predictions. Here we evaluate several state-of-the-art language models for their match to human next-word predictions and to reading time behavior from eye movements. We then propose a novel method for distilling the linguistic information implicit in human linguistic predictions into pre-trained LMs: Cloze Distillation. We apply this method to a baseline neural LM and show potential improvement in reading time prediction and generalization to held-out human cloze data.

机译：已经显示了当代自回归语言模型（LMS）训练的语料库数据，以捕获人类增量处理的许多特征。然而，过去的工作也建议在语料库概率和人类下一词预测之间进行解散。在这里，我们评估了几种最先进的语言模型，以便与人类的下一词预测匹配并从眼睛运动中读取时间行为。然后，我们提出了一种新的方法，用于将人类语言预测中隐含的语言信息蒸馏到预先训练的LMS：强缩蒸馏。我们将该方法应用于基线神经LM，并显示读取时间预测和概括的潜在改进，以阻止人的渗出数据。

著录项

来源
《Conference on Computational Natural Language Learning》|2020年|609-619|共11页
会议地点
作者
Tiwalayo N. Eisape; Noga Zaslavsky; Roger P. Levy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Language is more a human-driven system than a semiotic system Comment on "Modelling language evolution: Examples and predictions" by Tao Gong, Shuai Lan, Menghan Zhang [J] . Haitao Liu Physics of life reviews . 2014,第2期

机译：语言不是符号系统，而是人为驱动的系统。陶公，蓝帅，张梦涵对“建模语言进化：实例和预测”的评论
2. Teach Your Robot Your Language! Trainable Neural Parser for Modeling Human Sentence Processing: Examples for 15 Languages [J] . Hinaut Xavier, Twiefel Johannes IEEE Transactions on Cognitive and Developmental Systems . 2020,第2期

机译：教你的机器人你的语言！用于建模人类句子处理的可训练神经解析器：15种语言的例子
3. Improving artificial neural network model predictions of daily average PM_(10) concentrations by applying principle component analysis and implementing seasonal models [J] . Fatih Taspinar Journal of the air & waste management association . 2015,第7期

机译：通过应用主成分分析并实施季节模型来改进每日平均PM_（10）浓度的人工神经网络模型预测
4. Knowledge Distillation for Recurrent Neural Network Language Modeling with Trust Regularization [C] . Yangyang Shi, Mei-Yuh Hwang, Xin Lei, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：具有信任正则化的递归神经网络语言建模的知识提取
5. Improving Regional Hydrology Forecasting for the North Central Texas Region Utilizing Conditional Ensemble Streamflow and Hydrometeorological Condition Predictions with Artificial Neural Network Modeling [D] . Fincannon, Tyler. 2017

机译：利用人工神经网络建模，改善北部德克萨斯州德克萨斯州地区的区域水文预测
6. Placing language in an integrated understanding system: Next steps toward human-level performance in neural language models [O] . James L. McClelland, Felix Hill, Maja Rudolph, 2020

机译：在综合了解系统中放置语言：下一个朝着神经语言模型的人为级性能的步骤
7. Cloze Distillation: Improving Neural Language Models with Human Next-Word Prediction [O] . Tiwalayo Eisape, Noga Zaslavsky, Roger Levy 2020

机译：强化蒸馏：改善具有人类下一词预测的神经语言模型

Cloze Distillation: Improving Neural Language Models with Human Next-Word Predictions

摘要

著录项

相似文献

相关主题

期刊订阅