首页> 外文学位 >IRT-based automated test assembly: A sampling and stratification perspective.
【24h】

IRT-based automated test assembly: A sampling and stratification perspective.

机译:基于IRT的自动测试程序集:一个抽样和分层的观点。

获取原文
获取原文并翻译 | 示例

摘要

Each year the construction of several linear forms of an assessment is required for most large-scale assessments. Use of automated test assembly procedures to construct many parallel test forms greatly reduces the workload for test developers and ensures the quality of the tests. Existing automated test assembly methods include the heuristic approach, linear programming, network flow, and optimal design. All of these methods fall under the category of constrained combinatorial optimization (van der Linden, 1998).; The purpose of this study was to establish a new IRT-based automated test assembly method based on the sampling of test items from a specially stratified item bank such that the distribution of the items' parameter values mimicked that of the target test. Three such methods were introduced and developed in this study: the Cell Only Method, the Cell and Linear Programming Method (Cell & LP Method), and the Cell & Cube methods. Afterward, each of these methods was compared to the baseline, Minimax Model in Linear Programming Method. Six test forms of 40 items were assembled using each test assembly method. For the simulated item pool, a constraint of no more than 20% test overlap rate was added to both the linear programming component of the Cell & LP Method and the LP Method. Performance evaluation criteria included mean square deviation (MSD), form-to-form overlap rate, and test information function.; For tests assembled from the real item pool, the Cell Only Method proved to be superior to the other methods in terms of hitting target test information curves and providing lower MSDs. The Cell & Cube Method yielded the smallest test overlap rates. For tests constructed from the simulated 3000-item pool, the Cell & LP Method yielded the smallest MSDs. All three new methods yielded relatively smaller mean square deviation than the Minimax Model of the Linear Programming Method. Even when a test overlap rate constraint was added to the Minimax Model of the Linear Programming Method, the average test overlap rate was still higher than the three new methods. Overall, the Cell & Cube Method was recommended for its simplicity and item pool use.
机译:每年,大多数大型评估都需要构建几种线性形式的评估。使用自动化测试组装过程来构建许多并行测试表单,大大减少了测试开发人员的工作量并确保了测试质量。现有的自动测试组装方法包括启发式方法,线性编程,网络流和最佳设计。所有这些方法都属于约束组合优化的范畴(van der Linden,1998)。这项研究的目的是建立一个基于IRT的自动测试组装新方法,该方法基于对来自特定分层项目库的测试项目进行采样,以使项目的参数值的分布模仿目标测试的参数值。在此研究中引入和开发了三种此类方法:仅单元方法,单元和线性编程方法(Cell&LP方法)以及Cell&Cube方法。然后,将这些方法中的每一个都与基线(线性编程方法中的Minimax模型)进行比较。使用每种测试组装方法组装了40个项目的六个测试表单。对于模拟项目库,将不超过20%的测试重叠率约束添加到Cell&LP方法和LP方法的线性编程组件中。性能评估标准包括均方差(MSD),表格间重叠率和测试信息功能。对于从实际项目库中组装的测试,仅单元格方法在击中目标测试信息曲线并提供较低的MSD方面被证明优于其他方法。 Cell&Cube方法产生最小的测试重叠率。对于从模拟的3000个项目池构建的测试,Cell&LP方法产生的MSD最小。与线性规划方法的Minimax模型相比,所有这三种新方法均产生了相对较小的均方差。即使将测试重叠率约束添加到线性规划方法的Minimax模型中,平均测试重叠率仍高于三种新方法。总体而言,建议使用“单元格和多维数据集”方法,因为它简单易用且可以使用项目池。

著录项

  • 作者

    Chen, Pei-Hua.;

  • 作者单位

    The University of Texas at Austin.;

  • 授予单位 The University of Texas at Austin.;
  • 学科 Education Tests and Measurements.; Psychology Psychometrics.; Operations Research.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 123 p.
  • 总页数 123
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 教育;心理学研究方法;运筹学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号