首页> 外文学位 >Bayesian model uncertainty and prior choice with applications to genetic association studies.
【24h】

Bayesian model uncertainty and prior choice with applications to genetic association studies.

机译:贝叶斯模型的不确定性和先验选择及其在遗传关联研究中的应用。

获取原文
获取原文并翻译 | 示例

摘要

The Bayesian approach to model selection allows for uncertainty in both model specific parameters and in the models themselves. Much of the recent Bayesian model uncertainty literature has focused on defining these prior distributions in an objective manner, providing conditions under which Bayes factors lead to the correct model selection, particularly in the situation where the number of variables, p, increases with the sample size, n. This is certainly the case in our area of motivation the biological application of genetic association studies involving single nucleotide polymorphisms. While the most common approach to this problem has been to apply a marginal test to all genetic markers, we employ analytical strategies that improve upon these marginal methods by modeling the outcome variable as a function of a multivariate genetic profile using Bayesian variable selection. In doing so, we perform variable selection on a large number of correlated covariates within studies involving modest sample sizes.In particular, we present an efficient Bayesian model search strategy that searches over the space of genetic markers and their genetic parametrization. The resulting method for Multilevel Inference of SNP Associations MISA, allows computation of multilevel posterior probabilities and Bayes factors at the global, gene and SNP level. We use simulated data sets to characterize MISA's statistical power, and show that MISA has higher power to detect association than standard procedures. Using data from the North Carolina Ovarian Cancer Study (NCOCS), MISA identifies variants that were not identified by standard methods and have been externally 'validated' in independent studies.In the context of Bayesian model uncertainty for problems involving a large number of correlated covariates we characterize commonly used prior distributions on the model space and investigate their implicit multiplicity correction properties first in the extreme case where the model includes an increasing number of redundant covariates and then under the case of full rank design matrices. We provide conditions on the asymptotic (in n and p) behavior of the model space prior required to achieve consistent selection of the global hypothesis of at least one associated variable in the analysis using global posterior probabilities (i.e. under 0-1 loss). In particular, under the assumption that the null model is true, we show that the commonly used uniform prior on the model space leads to inconsistent selection of the global hypothesis via global posterior probabilities (the posterior probability of at least one association goes to 1) when the rank of the design matrix is finite. In the full rank case, we also show inconsistency when p goes to infinity faster than n . Alternatively, we show that any model space prior such that the global prior odds of association increases at a rate slower than n results in consistent selection of the global hypothesis in terms of posterior probabilities.
机译:模型选择的贝叶斯方法允许模型特定参数和模型本身都存在不确定性。最近的许多贝叶斯模型不确定性文献都集中于客观地定义这些先验分布,提供了条件,在这些条件下贝叶斯因子导致正确的模型选择,尤其是在变量数量p随着样本数量增加而增加的情况下,n。在我们的研究领域中,涉及单核苷酸多态性的遗传关联研究的生物学应用肯定是这种情况。尽管解决此问题的最常见方法是对所有遗传标记应用边际检验,但我们采用分析策略,通过使用贝叶斯变量选择将结果变量建模为多元遗传图谱的函数,从而改善了这些边际方法。为此,我们在涉及样本量适中的研究中对大量相关的协变量进行了变量选择,特别是,我们提出了一种有效的贝叶斯模型搜索策略,可以在遗传标记及其遗传参数化的空间中进行搜索。用于SNP关联MISA的多层次推断的最终方法允许在全局,基因和SNP层次上计算多层次后验概率和贝叶斯因子。我们使用模拟数据集来表征MISA的统计能力,并表明MISA具有比标准程序更高的检测关联的能力。利用来自北卡罗来纳州卵巢癌研究(NCOCS)的数据,MISA可以识别出标准方法未鉴定出并已在独立研究中进行外部``验证''的变体。在涉及大量相关协变量的贝叶斯模型不确定性背景下我们在模型空间上刻画了常用的先验分布,并在极端情况下研究了它们的隐式多重校正特性,在极端情况下,模型包括数量不断增加的冗余协变量,然后在满秩设计矩阵的情况下。我们提供了使用全局后验概率(即0-1损失下)来实现分析中至少一个相关变量的全局假设的一致选择之前,需要模型空间的渐近(n和p)行为的条件。特别是,在假设无效模型为真的情况下,我们表明模型空间上常用的统一先验会导致通过全局后验概率(至少一个关联的后验概率变为1)对全局假设的不一致选择。当设计矩阵的秩是有限的时。在满秩情况下,当p到达n的速度快于n时,我们也显示出不一致。或者,我们表明,任何模型空间先验使得关联的全局先验几率以比n慢的速率增加,从而可以根据后验概率对全局假设进行一致的选择。

著录项

  • 作者

    Wilson, Melanie A.;

  • 作者单位

    Duke University.;

  • 授予单位 Duke University.;
  • 学科 Applied Mathematics.Statistics.Biology Genetics.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 112 p.
  • 总页数 112
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号