【24h】

Deep Learning in Automated Essay Scoring

机译:自动作文评分中的深度学习

获取原文

摘要

This paper explores the application of deep learning in automated essay scoring (AES). It uses the essay dataset #8 from the Automated Student Assessment Prize competition, hosted by the Kaggle platform, and a state-of-the-art Suite of Automatic Linguistic Analysis Tools (SALAT) to extract 1,463 writing features. A non-linear regressor deep neural network is trained to predict holistic scores on a scale of 10-60. This study shows that deep learning holds the promise to improve significantly the accuracy of AES systems, but that the current dataset and most essay datasets fall short of providing them with enough expertise (hand-graded essays) to exploit that potential. After the tuning of different sets of hyperparameters, the results show that the levels of agreement, as measured by the quadratic weighted kappa metric, obtained on the training, validation, and testing sets are 0.84, 0.63, and 0.58, respectively, while an ensemble (bagging) produced a kappa value of 0.80 on the testing set. Finally, this paper upholds that more than 1,000 hand-graded essays per writing construct would be necessary to adequately train the predictive student models on automated essay scoring, provided that all score categories are equally or fairly represented in the sample dataset.
机译:本文探讨了深度学习在自动作文评分(AES)中的应用。它使用Kaggle平台主办的“自动学生评估奖”比赛的论文数据集#8和最先进的自动语言分析工具(SALAT)套件来提取1,463个写作特征。训练了非线性回归器深度神经网络以预测10-60的整体得分。这项研究表明,深度学习有望显着提高AES系统的准确性,但是当前数据集和大多数论文数据集不足以为他们提供足够的专业知识(手工评分论文)以发掘这一潜力。对不同的超参数集进行调整后,结果表明,在训练,验证和测试集上获得的按二次加权kappa度量测得的一致性水平分别为0.84、0.63和0.58, (装袋)在测试集上得出的Kappa值为0.80。最后,本文认为,如果样本数据集中所有评分类别均等或公平地代表,则要在自动作文评分上充分地训练学生的预测模型,每个写作结构就需要超过1,000篇手工评分的作文。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号