首页> 外文学位 >An empirical examination of the impact of item parameters on IRT information functions in mixed format tests.

【24h】

An empirical examination of the impact of item parameters on IRT information functions in mixed format tests.

机译：对混合格式测试中项目参数对IRT信息功能的影响进行的经验检验。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

IRT, also referred as "modern test theory", offers many advantages over CTT-based methods in test development. Specifically, an IRT information function has the capability to build a test that has the desired precision of measurement for any defined proficiency scale when a sufficient number of test items are available. This feature is extremely useful when the information is used for decision making, for instance, whether an examinee attain certain mastery level. Computerized adaptive testing (CAT) is one of the many examples using IRT information functions in test construction.;The purposes of this study were as follows: (1) to examine the consequences of improving the test quality through the addition of more discriminating items with different item formats; (2) to examine the effect of having a test where its difficulty does not align with the ability level of the intended population; (3) to investigate the change in decision consistency and decision accuracy; and (4) to understand changes in expected information when test quality is either improved or degraded, using both empirical and simulated data.;Main findings from the study were as follows: (1) increasing the discriminating power of any types of items generally increased the level of information; however, sometimes it could bring adverse effect to the extreme ends of the ability continuum; (2) it was important to have more items that were targeted at the population of interest, otherwise, no matter how good the quality of the items may be, they were of less value in test development when they were not targeted to the distribution of candidate ability or at the cutscores; (3) decision consistency (DC), Kappa statistic, and decision accuracy (DA) increased with better quality items; (4) DC and Kappa were negatively affected when difficulty of the test did not match with the ability of the intended population; however, the effect was less severe if the test was easier than needed; (5) tests with more better quality items lowered false positive (FP) and false negative (FN) rate at the cutscores; (6) when test difficulty did not match with the ability of the target examinees, in general, both FP and FN rates increased; (7) polytomous items tended to yield more information than dichotomously scored items, regardless of the discriminating parameter and difficulty of the item; and (8) the more score categories an item had, the more information it could provide.;Findings from this thesis should help testing agencies and practitioners to have better understanding of the item parameters on item and test information functions. This understanding is crucial for the improvement of the item bank quality and ultimately on how to build better tests that could provide more accurate proficiency classifications. However, at the same time, item writers should be conscientious about the fact that the item information function is merely a statistical tool for building a good test, other criteria should also be considered, for example, content balancing and content validity.

机译：IRT，也称为“现代测试理论”，在测试开发中比基于CTT的方法具有许多优势。具体来说，IRT信息功能具有在足够数量的测试项目可用时针对任何定义的熟练等级构建具有所需测量精度的测试的能力。当信息用于决策时（例如，考生是否达到一定的掌握水平），此功能非常有用。计算机自适应测试（CAT）是在测试构造中使用IRT信息功能的众多示例之一。这项研究的目的如下：（1）通过添加更多区分项来检查提高测试质量的后果。不同的项目格式；（2）在难度不符合预期人口能力水平的情况下，进行测试的效果；（3）调查决策一致性和决策准确性的变化；（4）使用经验数据和模拟数据来理解当测试质量提高或降低时预期信息的变化。研究的主要发现如下：（1）增强通常提高的任何类型物品的辨别力信息水平；但是，有时可能会对能力连续性的极端产生不利影响；（2）有更多针对目标人群的项目很重要，否则，无论这些项目的质量如何，如果不针对目标人群的分布，它们在测试开发中的价值就较小。候选人能力或得分榜；（3）随着质量的提高，决策一致性（DC），Kappa统计信息和决策准确性（DA）提高；（4）当测试难度与目标人群的能力不匹配时，DC和Kappa受到负面影响；但是，如果测试比需要的容易，则效果不那么严重。（5）使用质量更高的项目进行测试可以降低得分的假阳性（FP）和假阴性（FN）率；（6）当考试难度与目标考生的能力不匹配时，FP和FN率总体上都增加了；（7）无论项目的区分参数和难易程度如何，多项目项比二等分项目往往产生更多的信息；（8）一个项目拥有的得分类别越多，它所能提供的信息就越多。论文的发现应有助于测试机构和从业人员更好地理解项目参数以及测试信息功能。这种理解对于提高物料库质量至关重要，并最终对如何建立更好的测试以提供更准确的能力等级至关重要。但是，与此同时，项目编写者应对项目信息功能仅是用于构建良好测试的统计工具这一事实保持谨慎，还应考虑其他标准，例如，内容平衡和内容有效性。

著录项

作者
Lam, Wai Yan Wendy.;
展开▼
作者单位

University of Massachusetts Amherst.;

展开▼
授予单位 University of Massachusetts Amherst.;
学科 Education Tests and Measurements.
学位 Ed.D.
年度 2012
页码 198 p.
总页数 198
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The Impact of Varied Discrimination Parameters on Mixed-Format Item Response Theory Model Selection [J] . Whittaker T.A., Chang W., Dodd B.G. Educational and Psychological Measurement . 2013,第3期

机译：各种区分参数对混合格式项目反应理论模型选择的影响
2. Comparing Concurrent versus Fixed Parameter Equating with Common Items: Using the Dichotomous and Partial Credit Models in a Mixed-Item Format Test [J] . Husein M. Taherbhai, Daer Yong Seo Journal of applied measurement . 2007,第1期

机译：比较并发参数和固定参数参数与常见项目：在混合项目格式测试中使用二分法和部分信用模型
3. Item Response Theory With Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model [J] . Tay Louis, Huang Qiming, Vermunt Jeroen K. Educational and Psychological Measurement . 2016,第1期

机译：带有协变量的项目响应理论（IRT-C）：评估三参数Logistic模型的项目回收率和差异项目功能
4. THE IMPACTS OF MARKET-BASED RESOURCES ON EXPORT PERFORMANCE: EMPIRICAL EXAMINATION OF NON-LINEAR AND MODERATING EFFECTS [C] . Nathaniel Boso, John W. Cadogan, Vicky M. Story AMA Winter Educators' Conference . 2012

机译：基于市场资源对出口绩效的影响：非线性和调节效应的实证检查
5. The impact of equating method and format representation of common items on the adequacy of mixed-format test equating using nonequivalent groups [D] . Hagge, Sarah Lynn 2010

机译：常见项目的等价化方法和格式表示形式对使用非等价组的等价混合形式测试等价性的影响
6. Impact of IRT item misfit on score estimates and severity classifications: an examination of PROMIS depression and pain interference item banks [O] . Yue Zhao -1

机译：IRT项目不匹配对得分估计和严重性分类的影响：检查PROMIS抑郁症和疼痛干扰项目库
7. Teenagers and young adults are increasingly using social networks as a means to interact and participate in constructing a multiple speech. Companies take direct options with followers in networks and use these virtual structures to approach their target. The purpose of this paper is to study, using empirical and observational methodology, how to build «Coca-Cola» brand image in «Tuenti», followed by the network over the public sector. Among other things, we will see how involved the brand and how followers, what are the issues that are introduced on the inputs and through what kind of formats. In conclusion, we noted that the interest in the brand of free speech to let the followers is just a strategy, the actual entries of «Coca-Cola» are very rare but they all have a high effect, a language that challenges the user to activate and resume his speech so directed. Moreover, there is no mechanism differentiating between information, entertainment and advertising, which combined with continued exposure to advertising impacts across different formats, leads us to propose the need for media education to encourage responsible use critic and social networks by young people. [O] . Carmen Lazo, Marta, Martínez Rodrigo, Estrella, Sánchez Martín, Lourdes 2013

机译：青少年和年轻人越来越多地使用社交网络作为交互和参与构建的手段多个演讲。公司直接选择网络中的关注者并使用这些虚拟结构接近他们的目标。本文的目的是使用经验和观察方法研究如何在«Tuenti»建立«可口可乐»品牌形象，随后是公共部门的网络。除其他外，我们将看到品牌及其追随者的参与程度，以及在投入和投入上引入的问题通过什么样的格式。总之，我们注意到对自由言论品牌的兴趣让追随者只是一个策略，«可口可乐»的实际条目是非常罕见的，但它们都有很高的效果，一种语言，挑战用户如此指示激活和恢复他的演讲。而且，没有机制区分信息，娱乐和广告之间，以及持续接触广告不同形式的影响，使我们建议需要媒体教育来鼓励负责任的使用评论家和年轻人的社交网络。
8. ASCAL: A Microcomputer Program for Estimating Logistic IRT (Item Response Theory) Item Parameters [R] . Vale, C. D., Gialluca, K. A. 1985

机译：asCaL：用于估计Logistic IRT（项目响应理论）项目参数的微机程序

An empirical examination of the impact of item parameters on IRT information functions in mixed format tests.

摘要

著录项

相似文献

相关主题

期刊订阅