首页> 外文会议>AAAI Symposium >Trustworthy Automated Essay Scoring without Explicit Construct Validity

【24h】

Trustworthy Automated Essay Scoring without Explicit Construct Validity

机译：无标准的自动化论文评分没有明确构建有效性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automated essay scoring (AES) is a broadly used application of machine learning, with a long history of real-world use that impacts high-stakes decision-making for students. However, defensibility arguments in this space have typically been rooted in hand-crafted features and psychometrics research, which are a poor fit for recent advances in AI research and more formative classroom use of the technology. This paper proposes a framework for evaluating automated essay scoring models trained with more modern algorithms, used in a classroom setting; that framework is then applied to evaluate an existing product, Turnitin Revision Assistant.

机译：自动化论文评分（AES）是一种广泛应用的机器学习应用，具有悠久的真实用途历史，影响了学生的高赌注决策。然而，这种空间中的可退款性争论通常是植根于手工制作的特征和精神仪研究，这是近期AI研究和更具形成性课堂使用该技术的难度。本文提出了一种评估使用更多现代算法培训的自动论文评分模型的框架，用于教室设置; 然后应用该框架来评估现有的产品，转闭件修订版助理。

著录项

来源
《AAAI Symposium》|2018年|608p|共8页
会议地点
作者
Patti West-Smith; Stephanie Butler; Elijah Mayfield;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Evaluating the validity and applicability of automated essay scoring in two massive open online courses [J] . Erin Dawna Reilly, Rose Eleanore Stafford, Kyle Marie Williams, International Review of Research in Open and Distributed Learning . 2014,第5期

机译：在两个大规模的在线公开课程中评估自动作文评分的有效性和适用性
2. Stumping e-rater: challenging the validity of automated essay scoring [J] . Donald E. Power, Jill C. Burstein, Marthin Chodorow Computers in Human Behavior . 2002,第2期

机译：绊倒电子评分者：挑战自动作文评分的有效性
3. Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests [J] . Russell G. Almond International Journal of Testing: Official Journal of the International Test Commission . 2014,第1期

机译：等同于构建的反应写作测试时，使用自动作文成绩作为锚点
4. Trustworthy Automated Essay Scoring without Explicit Construct Validity [C] . Patti West-Smith, Stephanie Butler, Elijah Mayfield AAAI Symposium . 2018

机译：无标准的自动化论文评分没有明确构建有效性
5. Examining the Effects of Item Difficulty and Rating Method on Rating Reliability and Construct Validity of Constructed-Response and Essay Items on English Examinations [D] . Yao, Yuan. 2019

机译：检查项目难度和评级方法对英语考试中构建响应和论文项目的额定可靠性和构建有效性
6. Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test [O] . Noelle LaVoie, James Parker, Peter J. Legree, 2020

机译：使用潜在语义分析来得分简短答案构建响应：后果测试的自动评分
7. Chapter 7. Construct Validity, Length, Score, and Time in Holistically Graded Writing Assessments: The Case against Automated Essay Scoring (AES) [O] . Les Perelman 2012

机译：第7章在全面评级写作评估中构建有效性，长度，分数和时间：对自动论文评分（AES）的情况

Trustworthy Automated Essay Scoring without Explicit Construct Validity

摘要

著录项

相似文献

相关主题

期刊订阅