Essay Quality Signals as Weak Supervision for Source-based Essay Scoring

机译：散文质量信号作为基于源的文章评分的弱势监督

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human essay grading is a laborious task that can consume much time and effort. Automated Essay Scoring (AES) has thus been proposed as a fast and effective solution to the problem of grading student writing at scale. However, because AES typically uses supervised machine learning, a human-graded essay corpus is still required to train the AES model. Unfortunately, such a graded corpus often does not exist, so creating a corpus for machine learning can also be a laborious task. This paper presents an investigation of replacing the use of human-labeled essay grades when training an AES system with two automatically available but weaker signals of essay quality: word count and topic distribution similarity. Experiments using two source-based essay scoring (evidence score) corpora show that while weak supervision does not yield a competitive result when training a neural source-based AES model, it can be used to successfully extract Topical Components (TCs) from a source text, which are required by a supervised feature-based AES model. In particular, results show that feature-based AES performance is comparable with either automatically or manually constructed TCs.

机译：人类的论文分级是一个艰苦的任务，可以消耗很多时间和努力。因此，已经提出了自动化论文评分（AES）作为在规模上评分学生写作的快速有效解决方案。然而，因为AES通常使用受监管机器学习，所以仍然需要一种人分级的论文语料库来训练AES模型。不幸的是，这种分级的语料库通常不存在，因此为机器学习创建一个语料库也可以是一个艰苦的任务。本文在训练AES系统时，对培训AES系统的培训时，呈现出人类标记的论文等级的使用调查，但是论文质量的信号较弱：字数和主题分布相似度。使用基于两个源的论文评分（证据评分）Corpora的实验表明，虽然在培训基于神经源的AES模型时弱监管不会产生竞争结果，但它可以用于从源文本中成功提取主题组件（TCS），由受监督的特征的AES模型需要。特别地，结果表明，基于特征的AES性能与自动或手动构造的TCS相当。

著录项

来源
《Workshop on Innovative use of NLP for Building Educational Applications》|2021年|85-96|共12页
会议地点
作者
Haoran Zhang; Diane Litman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:14

相似文献

外文文献
中文文献
专利

1. Predictive Modeling of Rater Behavior: Implications for Quality Assurance in Essay Scoring [J] . Bejar Isaac I., Li Chen, McCaffrey Daniel Applied Measurement in Education . 2020,第3期

机译：rater行为的预测建模：论文评分质量保证的影响
2. Incorporating learning characteristics into automatic essay scoring models: What individual differences and linguistic features tell us about writing quality. [J] . Scott Crossley, Laura K Allen, Erica L Snow, Journal of Educational Data Mining . 2016,第2期

机译：将学习特征纳入自动论文评分模型：个体差异和语言特征告诉我们关于写作质量的信息。
3. Automated Essay Scoring by Capturing Relative Writing Quality [J] . Hongbo Chen, Jungang Xu, Ben He The Computer journal . 2014,第9期

机译：通过捕获相对写作质量来自动评分论文
4. Features selection of high quality essays in automated essay scoring system [C] . Wang Mingtao, Tan Yongmei, Li Chao 2011 International Conference on Electrical and Control Engineering . 2011

机译：在自动作文评分系统中选择高质量的作文
5. Essays in trade, foreign aid and investment. Essay 1: Motivation for bilateral aid allocation: Altruism or trade benefits. Essay 2: Does institutional quality matter for investment and capital mobility? Evidence from sub -Saharan Africa. Essay 3: Do donors care about declining trade revenue from liberalization? An analysis of aid allocation. (with Subhayu Bandyopadhyay) [D] . Younas, Javed 2007

机译：贸易，外国援助和投资方面的论文。作文1：分配双边援助的动机：利他主义或贸易利益。文章2：制度质量对投资和资本流动性有影响吗？来自撒哈拉以南非洲地区的证据。文章3：捐助者是否关心自由化带来的贸易收入下降？援助分配分析。（与Subhayu Bandyopadhyay一起）
6. Medical Essays. Essay First. On the Effects of Intestinal Irritation. Essay Second. On Some Effects of Loss of Blood. Essay Third. On Exhaustion and Sinking from Various Causes [O] . 1826

机译：医学论文。论文第一。关于肠胃刺激的影响。论文第二。关于失血的一些影响。论文三。从各种原因谈疲劳与沉没
7. Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring [O] . Haoran Zhang, Diane Litman 2020

机译：使用基于源的文章评分的神经网络关注分数自动化局部组件提取

Essay Quality Signals as Weak Supervision for Source-based Essay Scoring

摘要

著录项

相似文献

相关主题

期刊订阅