首页> 外文期刊>Applied Measurement in Education >An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure
【24h】

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

机译:小组讨论和考生表现信息对Angoff标准制定程序中判断的影响的实证检验

获取原文
获取原文并翻译 | 示例
       

摘要

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this study also compares the estimated proportion correct scores resulting from the Angoff exercise to empirical conditional proportion correct scores. In this research, judges made independent estimates of the proportion of minimally proficient candidates that would be expected to answer each item correctly; they then discussed discrepancies and revised their estimates. Discussion of discrepancies decreased the variance components associated with the judge and judge-by-item effects, indicating increased agreement between judges, but it did not improve the correspondence between the judgments and the empirical proportion correct estimates. The judges then were given examinee performance information for a subset of the items. Subsequent ratings showed a substantial increase in correspondence with the empirical conditional proportion correct estimates. Particular attention is given to examining the discrepancy between the judgments and empirical proportion correct estimates as a function of item difficulty.
机译:众多研究已将Angoff标准制定程序与其他标准制定方法进行了比较,但相对较少的研究根据内部标准评估了该程序。这项研究使用概化理论框架来评估估计切割分数的稳定性。为了提供内部一致性的度量,本研究还将Angoff练习得出的估计比例正确分数与经验条件比例正确分数进行了比较。在这项研究中,法官对预计能正确回答每个项目的最低熟练程度候选人的比例进行了独立估计。然后,他们讨论了差异并修订了估算值。差异的讨论减少了与法官和逐项法官效应相关的方差成分,表明法官之间的一致性增加了,但并没有改善判决与经验比例正确估计之间的对应关系。然后,为法官提供一些子集的考生表现信息。随后的评级显示出与经验条件比例正确估计值相对应的大幅增加。特别要注意检查判断和实证比例正确估计之间的差异,这些差异是项目难度的函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号