...
首页> 外文期刊>PLoS Genetics >A Pleiotropy-Informed Bayesian False Discovery Rate Adapted to a Shared Control Design Finds New Disease Associations From GWAS Summary Statistics
【24h】

A Pleiotropy-Informed Bayesian False Discovery Rate Adapted to a Shared Control Design Finds New Disease Associations From GWAS Summary Statistics

机译:适应共享控制设计的多向信息性贝叶斯错误发现率从GWAS摘要统计中发现新的疾病关联

获取原文
           

摘要

Genome-wide association studies (GWAS) have been successful in identifying single nucleotide polymorphisms (SNPs) associated with many traits and diseases. However, at existing sample sizes, these variants explain only part of the estimated heritability. Leverage of GWAS results from related phenotypes may improve detection without the need for larger datasets. The Bayesian conditional false discovery rate (cFDR) constitutes an upper bound on the expected false discovery rate (FDR) across a set of SNPs whose p values for two diseases are both less than two disease-specific thresholds. Calculation of the cFDR requires only summary statistics and have several advantages over traditional GWAS analysis. However, existing methods require distinct control samples between studies. Here, we extend the technique to allow for some or all controls to be shared, increasing applicability. Several different SNP sets can be defined with the same cFDR value, and we show that the expected FDR across the union of these sets may exceed expected FDR in any single set. We describe a procedure to establish an upper bound for the expected FDR among the union of such sets of SNPs. We apply our technique to pairwise analysis of p values from ten autoimmune diseases with variable sharing of controls, enabling discovery of 59 SNP-disease associations which do not reach GWAS significance after genomic control in individual datasets. Most of the SNPs we highlight have previously been confirmed using replication studies or larger GWAS, a useful validation of our technique; we report eight SNP-disease associations across five diseases not previously declared. Our technique extends and strengthens the previous algorithm, and establishes robust limits on the expected FDR. This approach can improve SNP detection in GWAS, and give insight into shared aetiology between phenotypically related conditions. Author Summary Many diseases have a significant hereditary component, only part of which has been explained by analysis of genome-wide association studies (GWAS). Shared aetiology, treatment protocols, and overlapping results from existing GWAS suggest similarities in genetic susceptibility between related diseases, which may be exploited to detect more disease-associated SNPs without the need for further data. We extend an existing method for detecting SNPs associated with a given disease by conditioning on association with another disease. Our extension allows GWAS for the two conditions to share control samples, enabling larger overall control groups and application to the common case when GWAS for related diseases pool control samples. We demonstrate that our technique limits the expected overall false discovery rate at a threshold dependent on the two diseases. We apply our technique to genotype data from ten immune mediated diseases. Overall pleiotropy between phenotypes is demonstrated graphically. We are able to declare several SNPs significant at a genome-wide level whilst controlling at a lower false-discovery rate than would be possible using a conventional approach, identifying eight previously unknown disease associations. This technique can improve SNP detection in GWAS by re-analysing existing data, and gives insight into the shared genetic bases of autoimmune diseases.
机译:全基因组关联研究(GWAS)已成功鉴定出与许多性状和疾病相关的单核苷酸多态性(SNP)。但是,在现有的样本量下,这些变体仅解释了部分估计的遗传力。利用相关表型的GWAS结果可以改善检测效果,而无需使用更大的数据集。贝叶斯条件错误发现率(cFDR)构成了一组SNP的预期错误发现率(FDR)的上限,SNP的两种疾病的p值均小于两个疾病特异性阈值。 cFDR的计算仅需要汇总统计信息,并且比传统的GWAS分析具有多个优势。但是,现有方法在研究之间需要不同的对照样品。在这里,我们扩展了该技术以允许共享某些或所有控件,从而提高了适用性。可以使用相同的cFDR值定义几个不同的SNP集,并且我们证明,这些集合并集的预期FDR可能会超出任何单个集合中的预期FDR。我们描述了一种程序,以在此类SNP集的联合之间建立预期FDR的上限。我们将我们的技术用于对十种自身免疫性疾病的p值进行成对分析,并进行控制变量共享,从而发现了59个SNP疾病关联,这些关联在单个数据集中进行基因组控制后未达到GWAS的重要意义。我们强调的大多数SNP先前已通过复制研究或更大的GWAS进行了确认,这是对我们技术的有用验证。我们报告了以前未宣布的五种疾病中的八个SNP疾病关联。我们的技术扩展并增强了先前的算法,并为预期的FDR建立了稳健的限制。这种方法可以改善GWAS中的SNP检测,并深入了解表型相关条件之间的共同病因。作者总结许多疾病具有重要的遗传成分,仅通过全基因组关联研究(GWAS)的分析即可解释其中的一部分。共有的病因学,治疗方案和现有GWAS的重叠结果表明,相关疾病之间的遗传易感性相似,可以将其用于检测更多与疾病相关的SNP,而无需进一步的数据。我们通过限制与另一种疾病的关联,扩展了一种用于检测与特定疾病关联的SNP的现有方法。我们的扩展允许GWAS在两个条件下共享控制样本,从而实现更大的整体控制组,并将其应用于相关疾病的GWAS合并控制样本的常见情况。我们证明了我们的技术将预期的总体错误发现率限制在取决于两种疾病的阈值上。我们将我们的技术应用于来自十种免疫介导疾病的基因型数据。表型之间的总体多效性通过图形显示。我们能够在全基因组水平上宣布几个重要的SNP,同时以比传统方法更低的错误发现率来控制,从而发现了8个以前未知的疾病关联。该技术可以通过重新分析现有数据来改善GWAS中的SNP检测,并深入了解自身免疫性疾病的共有遗传基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号