...
首页> 外文期刊>BMC Medical Research Methodology >Dealing with missing data in a multi-question depression scale: a comparison of imputation methods
【24h】

Dealing with missing data in a multi-question depression scale: a comparison of imputation methods

机译:在多问题抑郁量表中处理缺失数据:估算方法的比较

获取原文

摘要

Background Missing data present a challenge to many research projects. The problem is often pronounced in studies utilizing self-report scales, and literature addressing different strategies for dealing with missing data in such circumstances is scarce. The objective of this study was to compare six different imputation techniques for dealing with missing data in the Zung Self-reported Depression scale (SDS). Methods 1580 participants from a surgical outcomes study completed the SDS. The SDS is a 20 question scale that respondents complete by circling a value of 1 to 4 for each question. The sum of the responses is calculated and respondents are classified as exhibiting depressive symptoms when their total score is over 40. Missing values were simulated by randomly selecting questions whose values were then deleted (a missing completely at random simulation). Additionally, a missing at random and missing not at random simulation were completed. Six imputation methods were then considered; 1) multiple imputation, 2) single regression, 3) individual mean, 4) overall mean, 5) participant's preceding response, and 6) random selection of a value from 1 to 4. For each method, the imputed mean SDS score and standard deviation were compared to the population statistics. The Spearman correlation coefficient, percent misclassified and the Kappa statistic were also calculated. Results When 10% of values are missing, all the imputation methods except random selection produce Kappa statistics greater than 0.80 indicating 'near perfect' agreement. MI produces the most valid imputed values with a high Kappa statistic (0.89), although both single regression and individual mean imputation also produced favorable results. As the percent of missing information increased to 30%, or when unbalanced missing data were introduced, MI maintained a high Kappa statistic. The individual mean and single regression method produced Kappas in the 'substantial agreement' range (0.76 and 0.74 respectively). Conclusion Multiple imputation is the most accurate method for dealing with missing data in most of the missind data scenarios we assessed for the SDS. Imputing the individual's mean is also an appropriate and simple method for dealing with missing data that may be more interpretable to the majority of medical readers. Researchers should consider conducting methodological assessments such as this one when confronted with missing data. The optimal method should balance validity, ease of interpretability for readers, and analysis expertise of the research team.
机译:背景技术数据丢失对许多研究项目构成了挑战。在使用自我报告量表的研究中,这个问题通常很明显,而且在这种情况下,针对处理丢失数据的不同策略的文献很少。这项研究的目的是比较六种不同的归因技术来处理Zung自我报告的抑郁量表(SDS)中的缺失数据。方法来自手术结果研究的1580名参与者完成了SDS。 SDS是20个问题的量表,受访者通过将每个问题的值盘旋为1到4来完成。计算答案的总和,并且当受访者的总分超过40时,将他们分类为表现出抑郁症状。通过随机选择问题然后将其值删除来模拟缺失值(在随机模拟中完全缺失)。另外,完成了随机遗失和随机遗失的模拟。然后考虑了六种插补方法。 1)多重估算,2)单回归,3)个人平均值,4)总体平均值,5)参与者的先前反应以及6)从1到4的值的随机选择。每种方法的估算均值SDS得分和标准将偏差与人口统计数据进行比较。还计算了Spearman相关系数,错误分类的百分比和Kappa统计量。结果当缺少10%的值时,除随机选择外的所有插补方法均会产生大于0.80的Kappa统计数据,表明“接近完美”一致性。尽管单回归和个体均值插补也产生了良好的结果,但MI产生的最有效的插补值具有较高的Kappa统计量(0.89)。随着丢失信息的百分比增加到30%,或引入不平衡丢失数据时,MI保持了较高的Kappa统计数据。个体均值和单回归方法产生的Kappas在“基本一致”范围内(分别为0.76和0.74)。结论在我们为SDS评估的大多数误录数据场景中,多重插补是处理丢失数据的最准确方法。估算个人均值也是处理丢失数据的一种适当而简单的方法,对于大多数医学读者而言,这些数据可能更容易解释。研究人员在面对缺失数据时应考虑进行诸如此类的方法学评估。最佳方法应兼顾有效性,便于读者理解和研究团队的分析专业知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号