Dealing with missing data in a multi-question depression scale: a comparison of imputation methods

Fiona M Shrive; Heather Stuart; Hude Quan; William A Ghali

首页> 外文期刊>BMC Medical Research Methodology >Dealing with missing data in a multi-question depression scale: a comparison of imputation methods

【24h】

Dealing with missing data in a multi-question depression scale: a comparison of imputation methods

机译：在多问题抑郁量表中处理缺失数据：估算方法的比较

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background Missing data present a challenge to many research projects. The problem is often pronounced in studies utilizing self-report scales, and literature addressing different strategies for dealing with missing data in such circumstances is scarce. The objective of this study was to compare six different imputation techniques for dealing with missing data in the Zung Self-reported Depression scale (SDS). Methods 1580 participants from a surgical outcomes study completed the SDS. The SDS is a 20 question scale that respondents complete by circling a value of 1 to 4 for each question. The sum of the responses is calculated and respondents are classified as exhibiting depressive symptoms when their total score is over 40. Missing values were simulated by randomly selecting questions whose values were then deleted (a missing completely at random simulation). Additionally, a missing at random and missing not at random simulation were completed. Six imputation methods were then considered; 1) multiple imputation, 2) single regression, 3) individual mean, 4) overall mean, 5) participant's preceding response, and 6) random selection of a value from 1 to 4. For each method, the imputed mean SDS score and standard deviation were compared to the population statistics. The Spearman correlation coefficient, percent misclassified and the Kappa statistic were also calculated. Results When 10% of values are missing, all the imputation methods except random selection produce Kappa statistics greater than 0.80 indicating 'near perfect' agreement. MI produces the most valid imputed values with a high Kappa statistic (0.89), although both single regression and individual mean imputation also produced favorable results. As the percent of missing information increased to 30%, or when unbalanced missing data were introduced, MI maintained a high Kappa statistic. The individual mean and single regression method produced Kappas in the 'substantial agreement' range (0.76 and 0.74 respectively). Conclusion Multiple imputation is the most accurate method for dealing with missing data in most of the missind data scenarios we assessed for the SDS. Imputing the individual's mean is also an appropriate and simple method for dealing with missing data that may be more interpretable to the majority of medical readers. Researchers should consider conducting methodological assessments such as this one when confronted with missing data. The optimal method should balance validity, ease of interpretability for readers, and analysis expertise of the research team.

机译：背景技术数据丢失对许多研究项目构成了挑战。在使用自我报告量表的研究中，这个问题通常很明显，而且在这种情况下，针对处理丢失数据的不同策略的文献很少。这项研究的目的是比较六种不同的归因技术来处理Zung自我报告的抑郁量表（SDS）中的缺失数据。方法来自手术结果研究的1580名参与者完成了SDS。 SDS是20个问题的量表，受访者通过将每个问题的值盘旋为1到4来完成。计算答案的总和，并且当受访者的总分超过40时，将他们分类为表现出抑郁症状。通过随机选择问题然后将其值删除来模拟缺失值（在随机模拟中完全缺失）。另外，完成了随机遗失和随机遗失的模拟。然后考虑了六种插补方法。 1）多重估算，2）单回归，3）个人平均值，4）总体平均值，5）参与者的先前反应以及6）从1到4的值的随机选择。每种方法的估算均值SDS得分和标准将偏差与人口统计数据进行比较。还计算了Spearman相关系数，错误分类的百分比和Kappa统计量。结果当缺少10％的值时，除随机选择外的所有插补方法均会产生大于0.80的Kappa统计数据，表明“接近完美”一致性。尽管单回归和个体均值插补也产生了良好的结果，但MI产生的最有效的插补值具有较高的Kappa统计量（0.89）。随着丢失信息的百分比增加到30％，或引入不平衡丢失数据时，MI保持了较高的Kappa统计数据。个体均值和单回归方法产生的Kappas在“基本一致”范围内（分别为0.76和0.74）。结论在我们为SDS评估的大多数误录数据场景中，多重插补是处理丢失数据的最准确方法。估算个人均值也是处理丢失数据的一种适当而简单的方法，对于大多数医学读者而言，这些数据可能更容易解释。研究人员在面对缺失数据时应考虑进行诸如此类的方法学评估。最佳方法应兼顾有效性，便于读者理解和研究团队的分析专业知识。

著录项

来源
《BMC Medical Research Methodology 》 |2006年第1期| 共页
作者
Fiona M Shrive; Heather Stuart; Hude Quan; William A Ghali;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医药、卫生 ;
关键词

相似文献

外文文献
中文文献
专利

1. Missing data methods for dealing with missing items in quality of life questionnaires. A comparison by simulation of personal mean score, full information maximum likelihood, multiple imputation, and hot deck techniques applied to the SF-36 in the French 2003 decennial health survey [J] . Hugo Peyre, Alain Leplège, Joël Coste Quality of Life Research . 2011 ,第2期

机译：用于处理生活质量问卷中缺失项目的缺失数据方法。通过对法国2003年十年健康调查中SF-36所使用的个人平均得分，最大信息的最大信息可能性，多次归因和热甲板技术的模拟进行比较
2. Missing data methods for dealing with missing items in quality of life questionnaires. A comparison by simulation of personal mean score, full information maximum likelihood, multiple imputation, and hot deck techniques applied to the SF-36 in the French 2003 decennial health survey. [J] . Peyre H, Leplege A, Coste J Quality of life research: An international journal of quality of life aspects of treatment, care and rehabilitation . 2011 ,第2期

机译：用于处理生活质量问卷中缺失项目的缺失数据方法。通过对法国2003十年期健康调查中SF-36所使用的个人平均得分，全部信息的最大可能性，多次归因和热甲板技术的模拟进行比较。
3. A comparison of various software tools for dealing with missing data via imputation [J] . Jose Cortinas Abrahantes, Cristina Sotto, Geert Molenberghs, Journal of statistical computation and simulation . 2011 ,第11a12期

机译：各种通过插补处理丢失数据的软件工具的比较
4. Missing Network Data A Comparison of Different Imputation Methods [C] . Robert W Krause, Mark Huisman, Christian Steglich, IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining;International Symposium on Foundations of Open Source Intelligence and Security Informatics;International Symposium on Foundations and Applications of Big Data Analytics;International Symposium on Network Enabled Health Informatics, Biomedicine and Bioinformatics . 2018

机译：缺少网络数据-不同插补方法的比较
5. Statistical and machine learning techniques for dealing with missing data in criminal justice: A simulation and comparison of missing data methods. [D] . Hill, Joshua. 2012

机译：统计和机器学习技术，用于处理刑事司法中的缺失数据：缺失数据方法的模拟和比较。
6. Dealing with missing data in a multi-question depression scale: a comparison of imputation methods [O] . Fiona M Shrive, Heather Stuart, Hude Quan, 2006

机译：在多问题抑郁量表中处理缺失数据：估算方法的比较
7. Dealing with missing data in a multi-question depression scale: a comparison of imputation methods [O] . Stuart Heather, Shrive Fiona M, Quan Hude, 2006

机译：在多问题抑郁量表中处理缺失数据：估算方法的比较

Dealing with missing data in a multi-question depression scale: a comparison of imputation methods

摘要

著录项

相似文献

相关主题

期刊订阅