首页> 美国卫生研究院文献>Bioinformatics >Bayesian variable selection for binary outcomes in high-dimensional genomic studies using non-local priors
【2h】

Bayesian variable selection for binary outcomes in high-dimensional genomic studies using non-local priors

机译:使用非本地先验的高维基因组研究中用于二元结果的贝叶斯变量选择

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

>Motivation: The advent of new genomic technologies has resulted in the production of massive data sets. Analyses of these data require new statistical and computational methods. In this article, we propose one such method that is useful in selecting explanatory variables for prediction of a binary response. Although this problem has recently been addressed using penalized likelihood methods, we adopt a Bayesian approach that utilizes a mixture of non-local prior densities and point masses on the binary regression coefficient vectors.>Results: The resulting method, which we call iMOMLogit, provides improved performance in identifying true models and reducing estimation and prediction error in a number of simulation studies. More importantly, its application to several genomic datasets produces predictions that have high accuracy using far fewer explanatory variables than competing methods. We also describe a novel approach for setting prior hyperparameters by examining the total variation distance between the prior distributions on the regression parameters and the distribution of the maximum likelihood estimator under the null distribution. Finally, we describe a computational algorithm that can be used to implement iMOMLogit in ultrahigh-dimensional settings (p    n) and provide diagnostics to assess the probability that this algorithm has identified the highest posterior probability model.>Availability and implementation: Software to implement this method can be downloaded at: .>Contact: or >Supplementary information: are available at Bioinformatics online.
机译:>动机:新的基因组技术的出现导致了海量数据集的产生。这些数据的分析需要新的统计和计算方法。在本文中,我们提出了一种这样的方法,该方法可用于选择用于预测二进制响应的解释变量。尽管最近已使用惩罚似然法解决了该问题,但我们采用了贝叶斯方法,该方法在二元回归系数向量上利用了非局部先验密度和点质量的混合。>结果:结果方法,我们将其称为iMOMLogit,在许多仿真研究中提供了改进的性能,可用于识别真实模型并减少估计和预测误差。更重要的是,它在几个基因组数据集上的应用所产生的预测使用的解释变量比竞争方法少得多,因此具有很高的准确性。我们还描述了一种通过检查回归参数上的先验分布与零分布下最大似然估计量的分布之间的总变化距离来设置先验超参数的新颖方法。最后,我们描述了一种可用于在超高维设置(p> n)中实现iMOMLogit的计算算法,并提供了诊断方法以评估该算法识别出最高后验概率模型的可能性。>可用性和实现: 可以从以下位置下载实现此方法的软件:>联系方式:或>补充信息:可从Bioinformatics在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号