A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

机译：自动化论文评分对数据稀疏性的性能敏感性研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automated essay scoring (AES) attempts to rate essays automatically using machine learning and natural language processing techniques, hoping to dramatically reduce the manual efforts involved. Given a target prompt and a set of essays (for the target prompt) to rate, established AES algorithms are mostly prompt-dependent, thereby heavily relying on labeled essays for the particular target prompt as training data, making the availability and the completeness of the labeled essays essential for an AES model to perform. In aware of this, this paper sets out to investigate the impact of data sparsity on the effectiveness of several state-of-the-art AES models. Specifically, on the publicly available ASAP dataset, the effectiveness of different AES algorithms is compared relative to different levels of data completeness, which are simulated with random sampling. To this end, we show that the classical RankSVM and KNN models are more robust to the data sparsity, compared with the end-to-end deep neural network models, but the latter leads to better performance after being trained on sufficient data.

机译：自动化论文评分（AES）尝试使用机器学习和自然语言处理技术对论文进行自动评分，以期显着减少所涉及的人工工作。给定目标提示和一组要评估的论文（针对目标提示），已建立的AES算法主要与提示相关，因此严重依赖于特定目标提示的标记论文作为训练数据，从而使目标提示的可用性和完整性成为可能。标记的论文对于AES模型的执行至关重要。有鉴于此，本文着手研究数据稀疏性对几种最新AES模型有效性的影响。具体来说，在可公开获得的ASAP数据集上，相对于数据完整性的不同级别，比较了不同AES算法的有效性，并使用随机抽样对其进行了仿真。为此，我们表明，与端到端的深度神经网络模型相比，经典的RankSVM和KNN模型对数据稀疏性更强健，但是在对足够的数据进行训练后，后者会带来更好的性能。

著录项

来源
《International conference on knowledge science, engineering and management》|2018年|104-116|共13页
会议地点
作者
Yanhua Ran; Ben He; Jungang Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Automated essay scoring; Data sparsity; Deep neural network;

机译：自动作文评分;数据稀疏;深度神经网络;

相似文献

外文文献
中文文献
专利

1. Appraising the scoring performance of automated essay scoring systemsSome additional considerations: Which essays? Which human raters? Which scores? [J] . Raczynski Kevin, Cohen Allan Applied Measurement in Education . 2018,第3期

机译：评估自动论文评分系统的评分绩效其他考虑因素：哪些论文？哪个人类评估者？哪个分数？
2. Imbalanced Learning Techniques for Improving the Performance of Statistical Models in Automated Essay Scoring [J] . Aluizio Haendchen Filho, Fernando Concatto, Jonathan Nau, Procedia Computer Science . 2019,第11期

机译：用于提高自动论文评分中统计模型性能的不平衡学习技术
3. Imbalanced Learning Techniques for Improving the Performance of Statistical Models in Automated Essay Scoring [J] . Aluizio Haendchen Filho, Fernando Concatto, Jonathan Nau, Procedia Computer Science . 2019,第1期

机译：用于提高自动论文评分中统计模型性能的不平衡学习技术
4. A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring [C] . Yanhua Ran, Ben He, Jungang Xu International Conference on Knowledge Science, Engineering and Management . 2018

机译：自动论文评分数据稀疏性能敏感性研究
5. A Case Study to Assess the Effectiveness of an Automated Essay Scorer, Grade Eleven, as Measured by the Scores on the 2010 New York State English Regents. [D] . Chambless, Cynthia Cozart. 2011

机译：根据2010年纽约州英语摄政官的分数评估，评估自动写作评分器（十一年级）有效性的案例研究。
6. Performance of methods for meta-analysis of diagnostic test accuracy with few studies or sparse data [O] . Yemisi Takwoingi, Boliang Guo, Richard D Riley, -1

机译：通过很少的研究或稀疏数据进行诊断测试准确性的荟萃分析的方法的性能
7. A Study on the Application of Automated Essay Scoring in College English Writing Based on Pigai [O] . Wenxin Zhu 2019

机译：基于鸽子的大学英语写作中自动论文评分的应用研究
8. Business Case for Systems Engineering Study: Assessing Project Performance from Sparse Data. [R] . Elm, J. P. 2012

机译：系统工程研究的商业案例：从稀疏数据评估项目绩效。

A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

摘要

著录项

相似文献

相关主题

期刊订阅