首页> 美国卫生研究院文献>other >VARIABLE SELECTION AND PREDICTION WITH INCOMPLETE HIGH-DIMENSIONAL DATA
【2h】

VARIABLE SELECTION AND PREDICTION WITH INCOMPLETE HIGH-DIMENSIONAL DATA

机译:不完整的高维数据的变量选择和预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We propose a Multiple Imputation Random Lasso (mirl) method to select important variables and to predict the outcome for an epidemiological study of Eating and Activity in Teens. In this study 80% of individuals have at least one variable missing. Therefore, using variable selection methods developed for complete data after listwise deletion substantially reduces prediction power. Recent work on prediction models in the presence of incomplete data cannot adequately account for large numbers of variables with arbitrary missing patterns. We propose MIRL to combine penalized regression techniques with multiple imputation and stability selection. Extensive simulation studies are conducted to compare MIRL with several alternatives. MIRL outperforms other methods in high-dimensional scenarios in terms of both reduced prediction error and improved variable selection performance, and it has greater advantage when the correlation among variables is high and missing proportion is high. MIRL is shown to have improved performance when comparing with other applicable methods when applied to the study of Eating and Activity in Teens for the boys and girls separately, and to a subgroup of low social economic status (ses) Asian boys who are at high risk of developing obesity.
机译:我们提出了一种多重插补随机套索(mirl)方法,以选择重要变量并预测青少年饮食和活动的流行病学研究结果。在这项研究中,有80%的个人至少缺少一个变量。因此,使用针对列表删除后为完整数据开发的变量选择方法会大大降低预测能力。在存在不完整数据的情况下,有关预测模型的最新工作不能充分考虑大量具有任意缺失模式的变量。我们提出MIRL将惩罚回归技术与多重插补和稳定性选择相结合。进行了广泛的仿真研究,以将MIRL与几种替代方案进行比较。在减少预测误差和提高变量选择性能方面,MIRL在高维方案中均优于其他方法,并且当变量之间的相关性较高且缺失比例较高时,它具有更大的优势。与其他适用的方法分别用于男孩和女孩的青少年饮食和活动研究以及社会经济地位低(ses)的高风险亚洲男孩组相比,MIRL具有更好的表现肥胖。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号