...
首页> 外文期刊>Pharmacoepidemiology and drug safety >Issues in multiple imputation of missing data for large general practice clinical databases.
【24h】

Issues in multiple imputation of missing data for large general practice clinical databases.

机译:大型一般诊所临床数据库的多重缺失数据丢失数据的问题。

获取原文
获取原文并翻译 | 示例
           

摘要

PURPOSE: Missing data are a substantial problem in clinical databases. This paper aims to examine patterns of missing data in a primary care database, compare this to nationally representative datasets and explore the use of multiple imputation (MI) for these data. METHODS: The patterns and extent of missing health indicators in a UK primary care database (THIN) were quantified using 488 384 patients aged 16 or over in their first year after registration with a GP from 354 General Practices. MI models were developed and the resulting data compared to that from nationally representative datasets (14 142 participants aged 16 or over from the Health Survey for England 2006 (HSE) and 4 252 men from the British Regional Heart Study (BRHS)). RESULTS: Between 22% (smoking) and 38% (height) of health indicator data were missing in newly registered patients, 2004-2006. Distributions of height, weight and blood pressure were comparable to HSE and BRHS, but alcohol and smoking were not. After MI the percentage of smokers and non-drinkers was higher in THIN than the comparison datasets, while the percentage of ex-smokers and heavy drinkers was lower. Height, weight and blood pressure remained similar to the comparison datasets. CONCLUSIONS: Given available data, the results are consistent with smoking and alcohol data missing not at random whereas height, weight and blood pressure missing at random. Further research is required on suitable imputation methods for smoking and alcohol in such databases.
机译:目的:缺少数据是临床数据库中的实质性问题。本文旨在检验初级保健数据库中缺失数据的模式,将其与国家代表性数据集进行比较,并探索这些数据的多重估算(MI)。方法:使用来自354名一般行为的GP后,使用488 384名初级保健数据库(薄)在英国初级保健数据库(薄)中缺失健康指标的模式和程度。与来自国家代表数据集相比,开发了MI模型,由此产生的数据(14人142人16岁或16岁或来自英国英国(HSE)和4 252名男子从英国地区心脏研究(BRHS))的参与者)。结果:2004 - 2006年新注册患者缺少22%(吸烟)和38%(高度)的健康指标数据。高度,体重和血压分布与HSE和BRHS相当,但酒精和吸烟不是。在MI之后,吸烟者和非饮酒者的百分比薄于比较数据集,而前吸烟者和沉重饮酒者的百分比较低。高度,重量和血压仍然与比较数据集相似。结论:给予可用数据,结果与无随机的吸烟和酒精数据一致,而随机缺少高度,体重和血压。在此类数据库中对吸烟和酒精的合适估算方法需要进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号