首页> 美国卫生研究院文献>other >Imputation of Variants from the 1000 Genomes Project Modestly Improves Known Associations and Can Identify Low-frequency Variant - Phenotype Associations Undetected by HapMap Based Imputation
【2h】

Imputation of Variants from the 1000 Genomes Project Modestly Improves Known Associations and Can Identify Low-frequency Variant - Phenotype Associations Undetected by HapMap Based Imputation

机译:来自1000个基因组项目的变异的插补适度地改善了已知的关联并且可以识别基于HapMap的插补无法检测到的低频变异-表型关联

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF <5%) and rare variants (<1%)) can enhance previously identified associations and identify novel loci, we selected 93 quantitative circulating factors where data was available from the InCHIANTI population study. These phenotypes included cytokines, binding proteins, hormones, vitamins and ions. We selected these phenotypes because many have known strong genetic associations and are potentially important to help understand disease processes. We performed a genome-wide scan for these 93 phenotypes in InCHIANTI. We identified 21 signals and 33 signals that reached P<5×10−8 based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P<5×10−11 respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P<5×10−8 in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10−12). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations.
机译:全基因组关联研究(GWA)受限于微阵列上存在的常见变体或可从HapMap项目数据推算而来。最近,“ 1000个基因组计划”的完成提供了来自1000多个个体测序的数百万个变体的变体和单倍型信息。为了帮助理解更多变体(包括低频(1%≤MAF <5%)和稀有变体(<1%))在多大程度上可以增强先前识别的关联并识别新的基因座,我们选择了93个定量循环因子,其中数据为可从InCHIANTI人口研究中获得。这些表型包括细胞因子,结合蛋白,激素,维生素和离子。我们选择这些表型是因为许多表型具有很强的遗传关联,并且对于帮助理解疾病过程具有潜在的重要性。我们对InCHIANTI中的这93个表型进行了全基因组扫描。我们分别基于HapMap和1000个基因组估算确定了21个信号和33个达到P <5×10 -8 的信号,以及分别达到9个和11个达到更严格的保守阈值P <5的信号×10 −11 。估算1000个基因组基因型数据可适度提高已知关联的强度。在两次分析中,在P <5×10 −8 下检测到20个关联(其中17个代表NHGRI目录中复制良好的信号),其中六个被相同的索引SNP捕获,五个在名义上更强与1000个基因组估算数据相关联,而与HapMap估算数据相关性更强。我们还检测了低频变体和表型之间的关联,而这种关联以前被基于HapMap的插补方法所遗漏。 rs112635299和SERPINA基因附近的alpha-1球蛋白之间的关联代表了已知的rs28929474(MAF = 0.007)和易患肺气肿的alpha1-抗胰蛋白酶之间的关联(P = 2.5×10 -12 )。我们的数据提供了重要的原理证明,即1000个基因组估算将检测新颖的,低频大效应关联。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号