...
首页> 外文期刊>Applied Psychological Measurement >Deriving Stopping Rules for Multidimensional Computerized Adaptive Testing
【24h】

Deriving Stopping Rules for Multidimensional Computerized Adaptive Testing

机译:推导多维计算机化自适应测试的停止规则

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Multidimensional computerized adaptive testing (MCAT) is able to provide a vector of ability estimates for each examinee, which could be used to provide a more informative profile of an examinee's performance. The current literature on MCAT focuses on the fixed-length tests, which can generate less accurate results for those examinees whose abilities are quite different from the average difficulty level of the item bank when there are only a limited number of items in the item bank. Therefore, instead of stopping the test with a predetermined fixed test length, the authors use a more informative stopping criterion that is directly related to measurement accuracy. Specifically, this research derives four stopping rules that either quantify the measurement precision of the ability vector (i.e., minimum determinant rule [D-rule], minimum eigenvalue rule [E-rule], and maximum trace rule [T-rule]) or quantify the amount of available information carried by each item (i.e., maximum Kullback-Leibler divergence rule [K-rule]). The simulation results showed that all four stopping rules successfully terminated the test when the mean squared error of ability estimation is within a desired range, regardless of examinees' true abilities. It was found that when using the D-, E-, or T-rule, examinees with extreme abilities tended to have tests that were twice as long as the tests received by examinees with moderate abilities. However, the test length difference with K-rule is not very dramatic, indicating that K-rule may not be very sensitive to measurement precision. In all cases, the cutoff value for each stopping rule needs to be adjusted on a case-by-case basis to find an optimal solution.
机译:多维计算机自适应测试(MCAT)能够为每个应试者提供能力估计的向量,该能力估计值可用于提供有关应试者表现的更多信息。当前关于MCAT的文献集中在固定长度的测试上,当那些项目库中的项目数量有限时,对于那些能力与项目库的平均难度水平有很大差异的应试者来说,生成的结果可能不太准确。因此,作者不是使用预定的固定测试长度来停止测试,而是使用了一种更具信息量的停止标准,该标准与测量精度直接相关。具体来说,这项研究得出了四个停止规则,这些规则可以量化能力矢量的测量精度(即最小行列式规则[D-规则],最小特征值规则[E-规则]和最大跟踪规则[T-规则])或量化每个项目携带的可用信息量(即,最大Kullback-Leibler散度规则[K规则])。仿真结果表明,当能力估计的均方误差在期望范围内时,无论考生的真实能力如何,所有四个停止规则均成功终止了测试。结果发现,使用D规则,E规则或T规则时,具有极端能力的考生往往所进行的考试的时间是中等能力的考生所接受考试时间的两倍。但是,与K规则的测试长度差异不是很明显,这表明K规则可能对测量精度不是很敏感。在所有情况下,都需要根据具体情况调整每个停止规则的临界值,以找到最佳解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号