首页> 外文期刊>Quality of Life Research >Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress
【24h】

Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress

机译:使用项目响应理论对数似然比(IRTLR)方法评估测量等效性,以评估差异项目功能(DIF):应用于身体功能和一般困扰的测量(带有插图)

获取原文
获取原文并翻译 | 示例
       

摘要

Background Methods based on item response theory (IRT) that can be used to examine differential item functioning (DIF) are illustrated. An IRT-based approach to the detection of DIF was applied to physical function and general distress item sets. DIF was examined with respect to gender, age and race. The method used for DIF detection was the item response theory log-likelihood ratio (IRTLR) approach. DIF magnitude was measured using the differences in the expected item scores, expressed as the unsigned probability differences, and calculated using the non-compensatory DIF index (NCDIF). Finally, impact was assessed using expected scale scores, expressed as group differences in the total test (measure) response functions. Methods The example for the illustration of the methods came from a study of 1,714 patients with cancer or HIV/AIDS. The measure contained 23 items measuring physical functioning ability and 15 items addressing general distress, scored in the positive direction. Results The substantive findings were of relatively small magnitude DIF. In total, six items showed relatively larger magnitude (expected item score differences greater than the cutoff) of DIF with respect to physical function across the three comparisons: “trouble with a long walk” (race), “vigorous activities” (race, age), “bending, kneeling stooping” (age), “lifting or carrying groceries” (race), “limited in hobbies, leisure” (age), “lack of energy” (race). None of the general distress items evidenced high magnitude DIF; although “worrying about dying” showed some DIF with respect to both age and race, after adjustment. Conclusions The fact that many physical function items showed DIF with respect to age, even after adjustment for multiple comparisons, indicates that the instrument may be performing differently for these groups. While the magnitude and impact of DIF at the item and scale level was minimal, caution should be exercised in the use of subsets of these items, as might occur with selection for clinical decisions or computerized adaptive testing. The issues of selection of anchor items, and of criteria for DIF detection, including the integration of significance and magnitude measures remain as issues requiring investigation. Further research is needed regarding the criteria and guidelines appropriate for DIF detection in the context of health-related items.
机译:说明了基于项目响应理论(IRT)的背景方法,可用于检查差异项目功能(DIF)。基于IRT的DIF检测方法已应用于身体功能和一般遇险项目集。对性别,性别,年龄和种族进行了检查。用于DIF检测的方法是项目响应理论对数似然比(IRTLR)方法。 DIF幅度是使用预期项目得分中的差异来衡量的,表示为无符号概率差异,并使用非补偿性DIF指数(NCDIF)进行计算。最后,使用预期的量表得分来评估影响,表示为总测试(度量)响应函数中的组差异。方法该方法的例子来自对1,714名癌症或HIV / AIDS患者的研究。该措施包括23项测量身体机能的项目和15项针对总体困扰的项目,在积极方向上进行了评分。结果实质性发现的DIF值相对较小。在以下三个比较中,总共有六个项目在身体功能方面表现出相对较大的DIF值(预期项目得分差异大于临界值):“麻烦行走”(种族),“剧烈活动”(种族,年龄) ),“弯曲,跪着弯腰”(年龄),“提起或携带食品杂货”(种族),“爱好有限,休闲”(年龄),“精力不足”(种族)。一般遇险项目均未显示出严重的DIF。尽管经过调整,“担心死亡”在年龄和种族方面都显示了一些DIF。结论许多生理功能项目即使在进行多次比较调整后仍显示出与年龄相关的DIF,这表明该仪器在这些人群中的表现可能有所不同。尽管DIF在项目和规模级别上的程度和影响极小,但应谨慎使用这些项目的子集,如选择临床决策或计算机化适应性测试时可能会发生这种情况。锚项目的选择和DIF检测标准的问题,包括重要程度和规模措施的整合,仍然是需要调查的问题。需要就与健康相关的项目中适合DIF检测的标准和指南进行进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号