首页> 外文学位 >IRT-based automated test assembly: A sampling and stratification perspective.

【24h】

IRT-based automated test assembly: A sampling and stratification perspective.

机译：基于IRT的自动测试程序集：一个抽样和分层的观点。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Each year the construction of several linear forms of an assessment is required for most large-scale assessments. Use of automated test assembly procedures to construct many parallel test forms greatly reduces the workload for test developers and ensures the quality of the tests. Existing automated test assembly methods include the heuristic approach, linear programming, network flow, and optimal design. All of these methods fall under the category of constrained combinatorial optimization (van der Linden, 1998).; The purpose of this study was to establish a new IRT-based automated test assembly method based on the sampling of test items from a specially stratified item bank such that the distribution of the items' parameter values mimicked that of the target test. Three such methods were introduced and developed in this study: the Cell Only Method, the Cell and Linear Programming Method (Cell & LP Method), and the Cell & Cube methods. Afterward, each of these methods was compared to the baseline, Minimax Model in Linear Programming Method. Six test forms of 40 items were assembled using each test assembly method. For the simulated item pool, a constraint of no more than 20% test overlap rate was added to both the linear programming component of the Cell & LP Method and the LP Method. Performance evaluation criteria included mean square deviation (MSD), form-to-form overlap rate, and test information function.; For tests assembled from the real item pool, the Cell Only Method proved to be superior to the other methods in terms of hitting target test information curves and providing lower MSDs. The Cell & Cube Method yielded the smallest test overlap rates. For tests constructed from the simulated 3000-item pool, the Cell & LP Method yielded the smallest MSDs. All three new methods yielded relatively smaller mean square deviation than the Minimax Model of the Linear Programming Method. Even when a test overlap rate constraint was added to the Minimax Model of the Linear Programming Method, the average test overlap rate was still higher than the three new methods. Overall, the Cell & Cube Method was recommended for its simplicity and item pool use.

机译：每年，大多数大型评估都需要构建几种线性形式的评估。使用自动化测试组装过程来构建许多并行测试表单，大大减少了测试开发人员的工作量并确保了测试质量。现有的自动测试组装方法包括启发式方法，线性编程，网络流和最佳设计。所有这些方法都属于约束组合优化的范畴（van der Linden，1998）。这项研究的目的是建立一个基于IRT的自动测试组装新方法，该方法基于对来自特定分层项目库的测试项目进行采样，以使项目的参数值的分布模仿目标测试的参数值。在此研究中引入和开发了三种此类方法：仅单元方法，单元和线性编程方法（Cell＆LP方法）以及Cell＆Cube方法。然后，将这些方法中的每一个都与基线（线性编程方法中的Minimax模型）进行比较。使用每种测试组装方法组装了40个项目的六个测试表单。对于模拟项目库，将不超过20％的测试重叠率约束添加到Cell＆LP方法和LP方法的线性编程组件中。性能评估标准包括均方差（MSD），表格间重叠率和测试信息功能。对于从实际项目库中组装的测试，仅单元格方法在击中目标测试信息曲线并提供较低的MSD方面被证明优于其他方法。 Cell＆Cube方法产生最小的测试重叠率。对于从模拟的3000个项目池构建的测试，Cell＆LP方法产生的MSD最小。与线性规划方法的Minimax模型相比，所有这三种新方法均产生了相对较小的均方差。即使将测试重叠率约束添加到线性规划方法的Minimax模型中，平均测试重叠率仍高于三种新方法。总体而言，建议使用“单元格和多维数据集”方法，因为它简单易用且可以使用项目池。

著录项

作者
Chen, Pei-Hua.;
展开▼
作者单位

The University of Texas at Austin.;

展开▼
授予单位 The University of Texas at Austin.;
学科 Education Tests and Measurements.; Psychology Psychometrics.; Operations Research.
学位 Ph.D.
年度 2005
页码 123 p.
总页数 123
原文格式 PDF
正文语种 eng
中图分类教育;心理学研究方法;运筹学;
关键词

相似文献

外文文献
中文文献
专利

1. Item Selection for the Development of Parallel Forms From an IRT-Based Seed Test Using a Sampling and Classification Approach [J] . Chen P.-H., Chang H.-H., Wu H. Educational and Psychological Measurement . 2012,第6期

机译：使用采样和分类方法从基于IRT的种子测试中开发平行形式的项目选择
2. Role of invasive and noninvasive testing in risk stratification of sudden cardiac death in children and young adults: an electrophysiologic perspective. [J] . Attari M, Dhala A Pediatric clinics of North America . 2004,第5期

机译：有创和无创检测在儿童和年轻成人心脏猝死风险分层中的作用：电生理学观点。
3. Automated detection of dual p16/Ki67 nuclear immunoreactivity in liquid-based Pap tests for improved cervical cancer risk stratification. [J] . Arkadiusz Gertych, Anika O Joseph, Ann E Walts, Annals of Biomedical Engineering: The Journal of the Biomedical Engineering Society . 2012,第5期

机译：在基于液体的子宫颈抹片检查中自动检测双重p16 / Ki67核免疫反应性，以改善子宫颈癌风险分层。
4. Fatigue Testing Method of Test Coupon and Structurally Equivalent Samples of Carbon Fiber Reinforced Polymer for Gas Turbine Engine Parts and Assemblies [C] . M. Nikhamkin, N. Sazhenkov, D. Samodurov International conference on industrial engineering . 2018

机译：燃气轮机发动机零件和组件的碳纤维增强聚合物的试样和结构等效试样的疲劳试验方法
5. Comparisons between classical test theory and item response theory in automated assembly of alternate test forms. [D] . Lin, Chuan-Ju. 2001

机译：经典测试理论与项目响应理论在替代测试表格自动组装中的比较。
6. Automated detection of dual p16/Ki67 nuclear immunoreactivity in liquid-based Pap tests for improved cervical cancer risk stratification [O] . Arkadiusz Gertych, Anika O. Joseph, Ann E. Walts, -1

机译：双P16 / Ki67核免疫反应的自动检测液体癌症患者患者的宫颈癌风险分层
7. Item Selection for the Development of Parallel Forms From an IRT-Based Seed Test Using a Sampling and Classification Approach [O] . Pei-hua Chen, Hua-hua Chang 2016

机译：使用抽样和分类方法从基于IRT的种子测试开发平行形式的项目选择
8. WIPP (Waste Isolation Pilot Plant)/SRL in Situ Tests: Part 2, Pictorial History of MIIT (Materials Interface Interactions Tests) and Final MIIT Matrices, Assemblies, and Sample Listings [R] . Wicks, G. G. , Weinle, M. E. , Molecke, M. A. 1987

机译：WIpp（废物隔离试验工厂）/ sRL原位测试：第2部分，mIIT图像历史（材料界面相互作用测试）和最终mIIT矩阵，装配和样品列表

IRT-based automated test assembly: A sampling and stratification perspective.

摘要

著录项

相似文献

相关主题

期刊订阅