A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

机译：自动论文评分数据稀疏性能敏感性研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automated essay scoring (AES) attempts to rate essays automatically using machine learning and natural language processing techniques, hoping to dramatically reduce the manual efforts involved. Given a target prompt and a set of essays (for the target prompt) to rate, established AES algorithms are mostly prompt-dependent, thereby heavily relying on labeled essays for the particular target prompt as training data, making the availability and the completeness of the labeled essays essential for an AES model to perform. In aware of this, this paper sets out to investigate the impact of data sparsity on the effectiveness of several state-of-the-art AES models. Specifically, on the publicly available ASAP dataset, the effectiveness of different AES algorithms is compared relative to different levels of data completeness, which are simulated with random sampling. To this end, we show that the classical RankSVM and KNN models are more robust to the data sparsity, compared with the end-to-end deep neural network models, but the latter leads to better performance after being trained on sufficient data.

机译：自动化论文评分（AES）试图使用机器学习和自然语言处理技术自动评估散文，希望大大降低所涉及的手动努力。鉴于目标提示和一组论文（对于目标提示）来评估，所建立的AES算法主要依赖于依赖，从而严重依赖于标记为特定目标提示作为培训数据的散文，使得可用性和完整性标记为AES模型所必需的散文。在意识到这一点，本文阐述了数据稀疏对几个最先进的AES模型的有效性的影响。具体地，在公开可用的ASAP数据集上，不同AES算法的有效性相对于不同的数据完整程度，与随机采样模拟。为此，我们表明，与端到端深度神经网络模型相比，经典的RankSVM和KNN模型对数据稀疏性更加强大，但后者在足够的数据培训后导致更好的性能。

著录项

来源
《International Conference on Knowledge Science, Engineering and Management》|2018年|526p|共13页
会议地点
作者
Yanhua Ran; Ben He; Jungang Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Automated essay scoring; Data sparsity; Deep neural network;

机译：自动论文评分;数据稀疏;深神经网络;

相似文献

外文文献
中文文献
专利

1. Appraising the scoring performance of automated essay scoring systemsSome additional considerations: Which essays? Which human raters? Which scores? [J] . Raczynski Kevin, Cohen Allan Applied Measurement in Education . 2018,第3期

机译：评估自动论文评分系统的评分绩效其他考虑因素：哪些论文？哪个人类评估者？哪个分数？
2. Imbalanced Learning Techniques for Improving the Performance of Statistical Models in Automated Essay Scoring [J] . Aluizio Haendchen Filho, Fernando Concatto, Jonathan Nau, Procedia Computer Science . 2019,第11期

机译：用于提高自动论文评分中统计模型性能的不平衡学习技术
3. Imbalanced Learning Techniques for Improving the Performance of Statistical Models in Automated Essay Scoring [J] . Aluizio Haendchen Filho, Fernando Concatto, Jonathan Nau, Procedia Computer Science . 2019,第1期

机译：用于提高自动论文评分中统计模型性能的不平衡学习技术
4. A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring [C] . Yanhua Ran, Ben He, Jungang Xu International conference on knowledge science, engineering and management . 2018

机译：自动化论文评分对数据稀疏性的性能敏感性研究
5. A Case Study to Assess the Effectiveness of an Automated Essay Scorer, Grade Eleven, as Measured by the Scores on the 2010 New York State English Regents. [D] . Chambless, Cynthia Cozart. 2011

机译：根据2010年纽约州英语摄政官的分数评估，评估自动写作评分器（十一年级）有效性的案例研究。
6. Performance of methods for meta-analysis of diagnostic test accuracy with few studies or sparse data [O] . Yemisi Takwoingi, Boliang Guo, Richard D Riley, -1

机译：通过很少的研究或稀疏数据进行诊断测试准确性的荟萃分析的方法的性能
7. A Study on the Application of Automated Essay Scoring in College English Writing Based on Pigai [O] . Wenxin Zhu 2019

机译：基于鸽子的大学英语写作中自动论文评分的应用研究
8. Business Case for Systems Engineering Study: Assessing Project Performance from Sparse Data. [R] . Elm, J. P. 2012

机译：系统工程研究的商业案例：从稀疏数据评估项目绩效。

A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

摘要

著录项

相似文献

相关主题

期刊订阅