首页> 外文期刊>Genetics: A Periodical Record of Investigations Bearing on Heredity and Variation >Genome-Wide Association Analyses Based on Broadly Different Specifications for Prior Distributions, Genomic Windows, and Estimation Methods
【24h】

Genome-Wide Association Analyses Based on Broadly Different Specifications for Prior Distributions, Genomic Windows, and Estimation Methods

机译:基于广泛不同的先验分布,基因组窗口和估计方法的规范的全基因组关联分析

获取原文
       

摘要

A currently popular strategy (EMMAX) for genome-wide association (GWA) analysis infers association for the specific marker of interest by treating its effect as fixed while treating all other marker effects as classical Gaussian random effects. It may be more statistically coherent to specify all markers as sharing the same prior distribution, whether that distribution is Gaussian, heavy-tailed (BayesA), or has variable selection specifications based on a mixture of, say, two Gaussian distributions [stochastic search and variable selection (SSVS)]. Furthermore, all such GWA inference should be formally based on posterior probabilities or test statistics as we present here, rather than merely being based on point estimates. We compared these three broad categories of priors within a simulation study to investigate the effects of different degrees of skewness for quantitative trait loci (QTL) effects and numbers of QTL using 43,266 SNP marker genotypes from 922 Duroc–Pietrain F2-cross pigs. Genomic regions were based either on single SNP associations, on nonoverlapping windows of various fixed sizes (0.5–3 Mb), or on adaptively determined windows that cluster the genome into blocks based on linkage disequilibrium. We found that SSVS and BayesA lead to the best receiver operating curve properties in almost all cases. We also evaluated approximate maximum a posteriori (MAP) approaches to BayesA and SSVS as potential computationally feasible alternatives; however, MAP inferences were not promising, particularly due to their sensitivity to starting values. We determined that it is advantageous to use variable selection specifications based on adaptively constructed genomic window lengths for GWA studies.
机译:当前流行的全基因组关联(GWA)分析策略(EMMAX)通过将特定效应标记为固定效应,同时将所有其他标记效应视为经典高斯随机效应来推断特定目标标记的关联。将所有标记指定为共享相同的先验分布可能在统计上更加一致,无论该分布是高斯分布,重尾分布(BayesA)还是基于例如两个高斯分布的混合变量选择规范[随机搜索和变量选择(SSVS)]。此外,所有此类GWA推论都应正式基于后验概率或测试统计数据,而不仅仅是基于点估计。我们在一项模拟研究中比较了这三大先验类别,以研究使用922头杜洛克-皮特兰F2杂交猪的43,266个SNP标记基因型对不同程度的偏斜度对数量性状基因座(QTL)效应和QTL数量的影响。基因组区域基于单个SNP关联,各种固定大小(0.5-3 Mb)的不重叠窗口或基于连锁不平衡将基因组聚集成块的自适应确定窗口。我们发现,几乎在所有情况下,SSVS和BayesA都能带来最佳的接收机工作曲线特性。我们还评估了BayesA和SSVS的近似最大后验(MAP)方法作为潜在的计算可行替代方案;但是,MAP推断没有希望,特别是由于它们对初始值敏感。我们确定使用基于自适应构建的基因组窗口长度的变量选择规范进行GWA研究是有利的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号