首页> 外文期刊>Genetic epidemiology. >Gene- or region-based analysis of genome-wide association studies.
【24h】

Gene- or region-based analysis of genome-wide association studies.

机译:基于基因或区域的全基因组关联研究分析。

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

With rapid advances in genotyping technologies in recent years and the growing number of available markers, genome-wide association studies are emerging as promising approaches for the study of complex diseases and traits. However, there are several challenges with analysis and interpretation of such data. First, there is a massive multiple testing problem, due to the large number of markers that need to be analyzed, leading to an increased risk of false positives and decreased ability for association studies to detect truly associated markers. In particular, the ability to detect modest genetic effects can be severely compromised. Second, a genetic association of a given single-nucleotide polymorphism as determined by univariate statistical analyses does not typically explain biologically interesting features, and often requires subsequent interpretation using a higher unit, such as a gene or region, for example, as defined by haplotype blocks. Third, missing genotypes in the data set and other data quality issues can pose challenges when comparisons across platforms and replications are planned. Finally, depending on the type of univariate analysis, computational burden can arise as the number of markers continues to grow into the millions. One way to deal with these and related challenges is to consider higher units for the analysis, such as genes or regions. This article summarizes analytical methods and strategies that have been proposed and applied by Group 16 to two genome-wide association data sets made available through the Genetic Analysis Workshop 16.
机译:随着近年来基因分型技术的飞速发展以及可用标记的数量不断增长,全基因组关联研究正在成为研究复杂疾病和特征的有前途的方法。但是,对此类数据的分析和解释存在一些挑战。首先,由于需要分析大量标记,因此存在大量的多重测试问题,导致假阳性风险增加,并且关联研究检测真正关联标记的能力降低。特别是,检测适度遗传效应的能力可能会严重受损。其次,通过单变量统计分析确定的给定单核苷酸多态性的遗传关联通常不能解释生物学上有趣的特征,并且通常需要随后使用更高单位进行解释,例如,如单倍型所定义的基因或区域块。第三,在计划跨平台比较和复制时,数据集中缺失的基因型和其他数据质量问题可能会带来挑战。最后,根据单变量分析的类型,随着标记数量继续增长到数百万,计算负担可能会增加。解决这些挑战和相关挑战的一种方法是考虑分析的更高单位,例如基因或区域。本文总结了第16组已经提出并将其应用到可通过遗传分析研讨会16获得的两个全基因组关联数据集的分析方法和策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号