首页> 外文会议>Second joint biostatistics symposium (2012) >Variable selection and estimation in generalized linear models with the seamless Lo penalty
【24h】

Variable selection and estimation in generalized linear models with the seamless Lo penalty

机译:具有无缝Lo罚分的广义线性模型中的变量选择和估计

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose variable selection and estimation in generalized linear models using the seamless Lo (SELO) penalized likelihood approach. The SELO penalty is a smooth function that very closely resembles the discontinuous Lo penalty. We develop an efficient algorithm to fit the model, and show that the SELO-GLM procedure has the oracle property in the presence of a diverging number of variables. We propose a Bayesian Information Criterion (BIC) to select the tuning parameter. We show that under some regularity conditions, the proposed SELO-GLM/BIC procedure consistently selects the true model. We perform simulation studies to evaluate the finite sample performance of the proposed methods. Our simulation studies show that the proposed SELO-GLM procedure has a better finite sample performance than several existing methods, especially when the number of variables is large and the signals are weak. We apply the SELO-GLM to analyze a breast cancer genetic dataset to identify the SNPs that are associated with breast cancer risk.
机译:在本文中,我们提出了使用无缝Lo(SELO)惩罚似然方法的广义线性模型中的变量选择和估计。 SELO罚分是一个平滑函数,非常类似于不连续的Lo罚分。我们开发了一种适合模型的有效算法,并表明SELO-GLM过程在存在数量众多的变量的情况下具有oracle属性。我们提出一种贝叶斯信息准则(BIC)以选择调整参数。我们表明,在某些规律性条件下,建议的SELO-GLM / BIC程序始终选择真实的模型。我们进行仿真研究,以评估所提出方法的有限样本性能。我们的仿真研究表明,与几种现有方法相比,拟议的SELO-GLM程序具有更好的有限采样性能,尤其是在变量数量大且信号弱的情况下。我们应用SELO-GLM来分析乳腺癌遗传数据集,以识别与乳腺癌风险相关的SNP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号