首页> 外文会议>Workshop on Innovative use of NLP for Building Educational Applications >Essay Quality Signals as Weak Supervision for Source-based Essay Scoring
【24h】

Essay Quality Signals as Weak Supervision for Source-based Essay Scoring

机译:散文质量信号作为基于源的文章评分的弱势监督

获取原文

摘要

Human essay grading is a laborious task that can consume much time and effort. Automated Essay Scoring (AES) has thus been proposed as a fast and effective solution to the problem of grading student writing at scale. However, because AES typically uses supervised machine learning, a human-graded essay corpus is still required to train the AES model. Unfortunately, such a graded corpus often does not exist, so creating a corpus for machine learning can also be a laborious task. This paper presents an investigation of replacing the use of human-labeled essay grades when training an AES system with two automatically available but weaker signals of essay quality: word count and topic distribution similarity. Experiments using two source-based essay scoring (evidence score) corpora show that while weak supervision does not yield a competitive result when training a neural source-based AES model, it can be used to successfully extract Topical Components (TCs) from a source text, which are required by a supervised feature-based AES model. In particular, results show that feature-based AES performance is comparable with either automatically or manually constructed TCs.
机译:人类的论文分级是一个艰苦的任务,可以消耗很多时间和努力。因此,已经提出了自动化论文评分(AES)作为在规模上评分学生写作的快速有效解决方案。然而,因为AES通常使用受监管机器学习,所以仍然需要一种人分级的论文语料库来训练AES模型。不幸的是,这种分级的语料库通常不存在,因此为机器学习创建一个语料库也可以是一个艰苦的任务。本文在训练AES系统时,对培训AES系统的培训时,呈现出人类标记的论文等级的使用调查,但是论文质量的信号较弱:字数和主题分布相似度。使用基于两个源的论文评分(证据评分)Corpora的实验表明,虽然在培训基于神经源的AES模型时弱监管不会产生竞争结果,但它可以用于从源文本中成功提取主题组件(TCS) ,由受监督的特征的AES模型需要。特别地,结果表明,基于特征的AES性能与自动或手动构造的TCS相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号