...
首页> 外文期刊>Annals of Human Genetics >On the performance of multiple imputation based on chained equations in tackling missing data of the African α3.7-globin deletion in a malaria association study
【24h】

On the performance of multiple imputation based on chained equations in tackling missing data of the African α3.7-globin deletion in a malaria association study

机译:疟疾关联研究中基于链式方程的多重插补在处理非洲α3.7-珠蛋白缺失缺失数据中的性能

获取原文
获取原文并翻译 | 示例

摘要

Multiple imputation based on chained equations (MICE) is an alternative missing genotype method that can use genetic and nongenetic auxiliary data to inform the imputation process. Previously, MICE was successfully tested on strongly linked genetic data. We have now tested it on data of the HBA2 gene which, by the experimental design used in a malaria association study in Tanzania, shows a high missing data percentage and is weakly linked with the remaining genetic markers in the data set. We constructed different imputation models and studied their performance under different missing data conditions. Overall, MICE failed to accurately predict the true genotypes. However, using the best imputation model for the data, we obtained unbiased estimates for the genetic effects, and association signals of the HBA2 gene on malaria positivity. When the whole data set was analyzed with the same imputation model, the association signal increased from 0.80 to 2.70 before and after imputation, respectively. Conversely, postimputation estimates for the genetic effects remained the same in relation to the complete case analysis but showed increased precision. We argue that these postimputation estimates are reasonably unbiased, as a result of a good study design based on matching key socio-environmental factors.
机译:基于链式方程(MICE)的多重插补是一种可选的缺失基因型方法,可以使用遗传和非遗传辅助数据来告知插补过程。以前,MICE已在紧密关联的遗传数据上成功进行了测试。我们现在已经在HBA2基因的数据上对其进行了测试,通过坦桑尼亚疟疾关联研究中使用的实验设计,该数据显示出较高的缺失数据百分比,并且与数据集中的其余遗传标记存在弱关联。我们构建了不同的归因模型,并研究了在不同缺失数据条件下的性能。总体而言,MICE无法准确预测真正的基因型。但是,使用最佳插补模型获取数据,我们获得了遗传效应的无偏估计,以及HBA2基因对疟疾阳性的关联信号。当使用相同的插补模型分析整个数据集时,关联信号在插补之前和之后分别从0.80增加到2.70。相反,与完整的病例分析相比,对遗传效应的切除后估计仍保持不变,但显示出更高的精确度。我们认为,由于基于匹配的关键社会环境因素进行了良好的研究设计,因此这些注入后的估计值是合理无偏的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号