首页> 外文学位 >Missing data in multivariate longitudinal studies: Comparing results from different missing data techniques using an empirical data set.
【24h】

Missing data in multivariate longitudinal studies: Comparing results from different missing data techniques using an empirical data set.

机译:多元纵向研究中的缺失数据:使用经验数据集比较不同缺失数据技术的结果。

获取原文
获取原文并翻译 | 示例

摘要

Problems of missing data arise in almost all developmental research. One common practice in solving the problem of missing data has been to delete those cases with missing data points. However, statisticians have shown that this type of missing data method is not adequate, since it can significantly decrease the sample size, thus reducing power, and provide results that are not representative for the population, hence introducing bias. Depending on the amount and causes of missingness, as well as the pattern of missing data, different approaches can be used. At this writing, the recommended missing data procedures are the maximum likelihood (ML) method and the multiple imputation (MI) method, particularly when dealing with multivariate and/or longitudinal datasets. However, most of the recommendations are based on simulation studies.; The goal of this dissertation was to provide empirical evidence of the performance of recommended missing data techniques as well as the traditional method of deleting cases when they are used on a data set from an empirical study that tries to find answers to real research questions. Three missing data techniques (i.e., listwise deletion, the ML method, and the MI method) were used with an empirical data set in order to evaluate the extent of differences in the results and, therefore, in the interpretations of results. In particular, the focus was on two sets of analyses: linear growth modeling and predicting a specific outcome from a set of prediction variables at previous points in time.; Data from the first three waves of the 4-H Study of Positive Youth Development were used for the analyses. The results showed that the three missing data techniques did not yield comparable results for research questions assessing different aspects of development (i.e., change over time or prediction effects). However, the results suggested that the listwise deletion method produces quite different results from the recommended missing data techniques, which was evident in large discrepancies in parameter estimates between the listwise deletion method and the recommended techniques, as well as substantially larger standard errors in the listwise deletion method. In addition, smaller sample size and loss of statistical power negatively affected results of the listwise deletion method. Nevertheless, the results also showed that in some cases the MI method produces different results from the ML method. One possible explanation of this difference might be the use of auxiliary variables with the MI method only.; Generalization of these conclusions is limited to those studies that have a large sample size with similar amounts of missing data for each variable, as was the case in the longitudinal sample of the 4-H Study. In addition, further studies should be explored in which the MI method is conducted with appropriate software designed specifically for imputing longitudinal data. Such analyses could allow researchers to determine the extent that results differ depending on the imputation model that is used.; The results of the present empirical analyses suggest that researchers should not use the listwise deletion method, but use both recommended missing data techniques (i.e., the ML and the MI method) in order to see if and how the results of their analyses change. In addition, a longitudinal researcher should not just depend on the missing data techniques to solve the problems of missing data; he or she should also take actions before data collection, such as creating a plan to develop an appropriate questionnaire and make an effort to increase the sample retention rate, in order to decrease attrition and missing data.
机译:几乎所有的开发研究都出现了数据丢失的问题。解决数据丢失问题的一种常见做法是删除那些数据点丢失的情况。但是,统计学家已经表明,这种类型的数据丢失方法是不足够的,因为它可以显着减小样本数量,从而降低功效,并提供不代表总体的结果,从而造成偏差。根据丢失的数量和原因以及丢失数据的模式,可以使用不同的方法。在撰写本文时,建议的缺失数据过程是最大似然(ML)方法和多重插补(MI)方法,尤其是在处理多元和/或纵向数据集时。但是,大多数建议是基于仿真研究的。本文的目的是为推荐的缺失数据技术的性能提供经验证据,以及当从经验研究的数据集中尝试删除案例时,传统的删除案例的方法试图找到真实研究问题的答案。三种缺失的数据技术(即按列表删除,ML方法和MI方法)与经验数据集一起使用,以便评估结果差异的程度,从而评估结果的解释。具体来说,重点是两套分析:线性增长建模和从先前时间点的一组预测变量预测特定结果。分析使用了来自4-H积极青少年发展研究的前三波的数据。结果表明,对于评估发展的不同方面(即随时间变化或预测效果)的研究问题,这三种缺失的数据技术无法产生可比的结果。但是,结果表明,按列表删除方法所产生的结果与建议的缺失数据技术截然不同,这在按列表删除方法和所建议的技术之间的参数估计存在较大差异以及按列表方法显着更大的标准误差中很明显删除方法。此外,较小的样本量和统计功效的丧失会对按列表删除方法的结果产生负面影响。但是,结果还表明,在某些情况下,MI方法与ML方法产生的结果不同。这种差异的一种可能解释可能是仅在MI方法中使用辅助变量。这些结论的概括仅限于样本量较大且每个变量缺少相似数据量的研究,如4-H研究的纵向样本中的情况。此外,应探索进一步的研究,其中使用专门为估算纵向数据而设计的适当软件进行MI方法。这种分析可以使研究人员确定结果的程度取决于所使用的归因模型。当前的经验分析结果表明,研究人员不应使用列表删除方法,而应同时使用推荐的缺失数据技术(即ML和MI方法),以查看其分析结果是否以及如何变化。另外,纵向研究者不应该仅仅依靠缺失数据技术来解决缺失数据的问题。他或她还应在数据收集之前采取行动,例如制定计划以制定适当的调查表,并努力提高样品保留率,以减少损耗和丢失数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号