首页> 外文学位 >A comparison of kernel equating and traditional equipercentile equating methods and the parametric bootstrap methods for estimating standard errors in equipercentile equating.
【24h】

A comparison of kernel equating and traditional equipercentile equating methods and the parametric bootstrap methods for estimating standard errors in equipercentile equating.

机译:比较内核等值法和传统等分法等价法以及用于估计等分法等值法中标准误差的参数自举法。

获取原文
获取原文并翻译 | 示例

摘要

This study used simulation (a) to compare the kernel equating method to traditional equipercentile equating methods under the equivalent-groups (EG) design and the nonequivalent-groups with anchor test (NEAT) design and (b) to apply the parametric bootstrap method for estimating standard errors of equating. A two-parameter logistic item response theory (2-PL IRT) model was used to create population score distributions for different test sizes. Samples were drawn from the populations for different examinee sizes and the equating methods were evaluated using a criterion equating, which was an equipercentile equating function using the entire populations. Bias, standard errors, and root mean squared difference (RMSD) were used as measures to compare the methods to the criterion equating. The results show that KE and its traditional analogues are comparable under the EG design. However, under the NEAT design for which populations were created substantially different in this study, only the Post-stratification equating (PSE) with small or optimal bandwidth can produce results similar to those from the traditional frequency estimation method. Using the same data for equating under the EG design, the parametric bootstrap method was compared to the nonparametric bootstrap method and the analytic method. The results indicated that the parametric method resulted in more accurate estimates of standard errors of equating for all test and sample sizes considered than the other methods did.
机译:这项研究使用模拟(a)将等值组(EG)设计和非等价组的锚定检验(NEAT)设计下的内核等值化方法进行了比较,以及(b)将参数自举方法应用于估计等式的标准误差。使用两参数逻辑物流项目响应理论(2-PL IRT)模型来创建不同测试量的总体得分分布。从不同样本量的人群中抽取样本,并使用标准等式评估等值方法,该等式等于整个人群的等分位数等值函数。偏差,标准误和均方根差(RMSD)被用作将方法与标准等式进行比较的度量。结果表明,在EG设计下,KE及其传统类似物具有可比性。但是,在本研究中针对其创建种群的NEAT设计中,只有具有较小或最佳带宽的后分层等值(PSE)才能产生与传统频率估算方法相似的结果。使用相同的数据在EG设计下进行相等,将参数自举方法与非参数自举方法和解析方法进行了比较。结果表明,与其他方法相比,参量方法可得出对所有测试和样本量相等的标准误差的准确估计。

著录项

  • 作者

    Choi, Sae Il.;

  • 作者单位

    University of Illinois at Urbana-Champaign.;

  • 授予单位 University of Illinois at Urbana-Champaign.;
  • 学科 Education Tests and Measurements.;Statistics.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 126 p.
  • 总页数 126
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:38:17

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号