首页> 美国卫生研究院文献>other >Finding Factors Influencing Risk: Comparing Variable Selection Methods Applied to Logistic Regression Models of Cases and Controls

【2h】

Finding Factors Influencing Risk: Comparing Variable Selection Methods Applied to Logistic Regression Models of Cases and Controls

机译：查找影响风险的因素：比较变量选择方法应用于案例和控制的Logistic回归模型

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

When modeling the risk of a disease, the very act of selecting the factors to include can heavily impact the results. This study compares the performance of several variable selection techniques applied to logistic regression. We performed realistic simulation studies to compare five methods of variable selection: (1) a confidence interval approach for significant coefficients (CI), (2) backward selection, (3) forward selection, (4) stepwise selection, and (5) Bayesian stochastic search variable selection (SSVS) using both informed and uniformed priors. We defined our simulated diseases mimicking odds ratios for cancer risk found in the literature for environmental factors, such as smoking; dietary risk factors, such as fiber; genetic risk factors such as XPD; and interactions. We modeled the distribution of our covariates, including correlation, after the reported empirical distributions of these risk factors. We also used a null data set to calibrate the priors of the Bayesian method and evaluate its sensitivity. Of the standard methods (95% CI, backward, forward and stepwise selection) the CI approach resulted in the highest average percent of correct associations and lowest average percent of incorrect associations. SSVS with an informed prior had higher average percent of correct associations and lower average percent of incorrect associations than did the CI approach. This study shows that Bayesian methods offer a way to use prior information to both increase power and decrease false-positive results when selecting factors to model complex disease risk.

机译：在对疾病风险进行建模时，选择要包括的因素的行为会严重影响结果。本研究比较了应用于逻辑回归的几种变量选择技术的性能。我们进行了现实的仿真研究，以比较五种变量选择方法：（1）有效系数（CI）的置信区间方法；（2）后向选择；（3）前向选择；（4）逐步选择；以及（5）贝叶斯方法使用知情和统一先验的随机搜索变量选择（SSVS）。我们定义了模拟疾病，以模仿文献中针对环境因素（例如吸烟）患癌症风险的比值比。饮食风险因素，例如纤维；遗传风险因素，例如XPD；和互动。在报告了这些风险因素的经验分布之后，我们对包括相关性在内的协变量分布进行了建模。我们还使用了一个空数据集来校准贝叶斯方法的先验并评估其灵敏度。在标准方法（95％CI，向后，向前和逐步选择）中，CI方法导致正确关联的平均百分比最高，而错误关联的平均百分比最低。具有先验知识的SSVS与CI方法相比，正确关联的平均百分比更高，而错误关联的平均百分比更低。这项研究表明，贝叶斯方法为选择复杂疾病风险的建模因素提供了一种利用先验信息来增加功效和减少假阳性结果的方法。

著录项

期刊名称 other
作者
Michael D. Swartz; Robert K. Yu; Sanjay Shete;
展开▼
作者单位

展开▼
年(卷),期 -1(27),29
年度 -1
页码 6158–6174
总页数 18
原文格式 PDF
正文语种
中图分类
关键词
Bayesian logistic regression Case-control analyses Logistic regression Prior calibration Variable selection;

机译：贝叶斯Logistic回归;案例分析;Logistic回归;事前校准;变量选择;

相似文献

外文文献
中文文献
专利

1. Finding factors influencing risk: Comparing Bayesian stochastic search and standard variable selection methods applied to logistic regression models of cases and controls. [J] . Swartz MD, Yu RK, Shete S Statistics in medicine . 2008,第29期

机译：查找影响风险的因素：比较贝叶斯随机搜索和标准变量选择方法，将其应用于病例和对照的逻辑回归模型。
2. Systematic Selection of Key Logistic Regression Variables for Risk Prediction Analyses: A Five-Factor Maximum Model [J] . Hewett Timothy E., Webster Kate E., Hurd Wendy J. Clinical journal of sport medicine: official journal of the Canadian Academy of Sport Medicine . 2019,第1期

机译：风险预测分析的关键逻辑回归变量的系统选择：五因素最大模型
3. Variable selection methods for multiple regressions influence the parsimony of risk prediction models for cardiac surgery [J] . Karim Md Nazmul, Epi M. Clin, Reid Christopher M., The Journal of Thoracic and Cardiovascular Surgery . 2017,第5期

机译：多元回归的可变选择方法会影响心脏手术风险预测模型的差异
4. Modeling of hypertension risk factors using local linear of additive nonparametric logistic regression [C] . E Ana, N Chamidah, P Andriani, International Conference on Research, Implementation, and Education Mathematics and Science . 2020

机译：应用局部线性添加剂非参数逻辑回归的高血压风险因素建模
5. Nonparametric regression as a general statistical modeling methodology: A Monte Carlo investigation of factors influencing statistical power and robust performance in the presence of moderator variables [D] . McLeod, Jeffrey Thomas. 1998

机译：非参数回归作为一般的统计建模方法：在主持人变量存在的情况下，对影响统计能力和鲁棒性能的因素进行蒙特卡洛研究
6. A simulation based method for assessing the statistical significance of logistic regression models after common variable selection procedures [O] . Tristan R. Grogan, David A. Elashoff -1

机译：评估公共变量选择程序后逻辑回归模型的统计显着性的基于仿真的方法
7. Comparation of logistic regression methods and discrete choice model in the selection of habitats Comparação dos métodos regressão logística e modelo de escolha discreta na seleção de habitats [O] . Sandra Vergara Cardozo, Bryan Frederick John Manly, Carlos Tadeu dos Santos Dias 2010

机译：栖息地选择中逻辑回归方法与离散选择模型的比较栖息地选择中逻辑回归方法与离散选择模型的比较

Finding Factors Influencing Risk: Comparing Variable Selection Methods Applied to Logistic Regression Models of Cases and Controls

摘要

著录项

相似文献

相关主题

期刊订阅