首页> 外文期刊>Applied Measurement in Education >The effectiveness of machine score-ability ratings in predicting automated scoring performance
【24h】

The effectiveness of machine score-ability ratings in predicting automated scoring performance

机译:机器分数能力评级预测自动评分性能的有效性

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This study sought to provide a framework for evaluating machine score-ability of items using a new score-ability rating scale, and to determine the extent to which ratings were predictive of observed automated scoring performance. The study listed and described a set of factors that are thought to influence machine score-ability; these factors informed the score-ability rating applied by expert raters. Five Reading items, six Science items, and 10 Math items were examined. Experts in automated scoring served as reviewers, providing independent ratings of score-ability before engine calibration. Following the rating, engines were calibrated and their performances were evaluated using common industry criteria. Three derived criteria from the engine evaluations were computed: the score-ability value in the rating scale based on the empirical results, the number of industry evaluation criteria met by the engine, the approval status of the engine based on the number of criteria met. The results indicated that the score-ability ratings were moderately correlated with Science score-ability, the ratings were weakly correlated with Math score-ability, and were not correlated with Reading score-ability.
机译:本研究寻求使用新的评分能力评级规模评估机器分数能力的框架,并确定评级是预测观察到的自动评分性能的程度。该研究列出并描述了一组被认为影响机器得分能力的因素;这些因素通知专家评级申请的得分能力评级。检查五个阅读项目,六种科学项目和10个数学项目。自动评分专家曾担任审查人员,在发动机校准之前提供独立的评分能力评分。评分后,发动机被校准,并使用普通行业标准评估其性能。从发动机评估的三个来自发动机评估的标准:基于经验结果的评级规模中得分能力值,发动机遇到的行业评估标准数量,基于标准的标准数量达到了发动机的批准状态。结果表明,评分能力评级与科学评分能力适度相关,评级与数学评分能力弱相关,并且与阅读分数能力无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号