首页> 外文期刊>Journal of Clinical Epidemiology >Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality.
【24h】

Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality.

机译:用于逻辑回归的自动变量选择方法产生了不稳定的模型,用于预测急性心肌梗死的死亡率。

获取原文
获取原文并翻译 | 示例
           

摘要

OBJECTIVES: Automated variable selection methods are frequently used to determine the independent predictors of an outcome. The objective of this study was to determine the reproducibility of logistic regression models developed using automated variable selection methods. STUDY DESIGN AND SETTING: An initial set of 29 candidate variables were considered for predicting mortality after acute myocardial infarction (AMI). We drew 1,000 bootstrap samples from a dataset consisting of 4,911 patients admitted to hospital with an AMI. Using each bootstrap sample, logistic regression models predicting 30-day mortality were obtained using backward elimination, forward selection, and stepwise selection. The agreement between the different model selection methods and the agreement across the 1,000 bootstrap samples were compared. RESULTS: Using 1,000 bootstrap samples, backward elimination identified 940 unique models for predicting mortality. Similar results were obtained for forward and stepwise selection. Three variables were identified as independent predictors of mortality among all bootstrap samples. Over half the candidate prognostic variables were identified as independent predictors in less than half of the bootstrap samples. CONCLUSION: Automated variable selection methods result in models that are unstable and not reproducible. The variables selected as independent predictors are sensitive to random fluctuations in the data.
机译:目标:自动变量选择方法通常用于确定结果的独立预测因子。这项研究的目的是确定使用自动变量选择方法开发的逻辑回归模型的可重复性。研究设计和设置:初步考虑了29个候选变量,以预测急性心肌梗死(AMI)后的死亡率。我们从包含4,911名因AMI入院的患者的数据集中抽取了1,000个引导程序样本。使用每个自举样本,使用后向消除,前向选择和逐步选择来获得预测30天死亡率的逻辑回归模型。比较了不同模型选择方法之间的一致性和1000个引导程序样本之间的一致性。结果:使用1,000个引导程序样本,向后消除确定了940个用于预测死亡率的独特模型。对于正向和逐步选择,获得了相似的结果。在所有引导程序样本中,三个变量被确定为死亡率的独立预测因子。在不到一半的引导样本中,超过一半的候选预后变量被确定为独立的预测因子。结论:自动变量选择方法导致模型不稳定且不可重现。被选作独立预测变量的变量对数据的随机波动敏感。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号