首页> 外文期刊>BMC Genetics >Computation of haplotypes on SNPs subsets: advantage of the
【24h】

Computation of haplotypes on SNPs subsets: advantage of the

机译:单核苷酸多态性子集的单倍型计算:优势

获取原文
       

摘要

Background Genetic association studies aim at finding correlations between a disease state and genetic variations such as SNPs or combinations of SNPs, termed haplotypes. Some haplotypes have a particular biological meaning such as the ones derived from SNPs located in the promoters, or the ones derived from non synonymous SNPs. All these haplotypes are "subhaplotypes" because they refer only to a part of the SNPs found in the gene. Until now, subhaplotypes were directly computed from the very SNPs chosen to constitute them, without taking into account the rest of the information corresponding to the other SNPs located in the gene. In the present work, we describe an alternative approach, called the "global method", which takes into account all the SNPs known in the region and compare the efficacy of the two "direct" and "global" methods. Results We used empirical haplotypes data sets from the GH1 promoter and the APOE gene, and 10 simulated datasets, and randomly introduced in them missing information (from 0% up to 20%) to compare the 2 methods. For each method, we used the PHASE haplotyping software since it was described to be the best. We showed that the use of the "global method" for subhaplotyping leads always to a better error rate than the classical direct haplotyping. The advantage provided by this alternative method increases with the percentage of missing genotyping data (diminution of the average error rate from 25% to less than 10%). We applied the global method software on the GRIV cohort for AIDS genetic associations and some associations previously identified through direct subhaplotyping were found to be erroneous. Conclusion The global method for subhaplotyping can reduce, sometimes dramatically, the error rate on patient resolutions and haplotypes frequencies. One should thus use this method in order to minimise the risk of a false interpretation in genetic studies involving subhaplotypes. In practice the global method is always more efficient than the direct method, but a combination method taking into account the level of missing information in each subject appears to be even more interesting when the level of missing information becomes larger (>10%).
机译:背景技术遗传关联研究旨在寻找疾病状态与遗传变异(例如单核苷酸多态性或单核苷酸多态性的组合)之间的相关性。一些单倍型具有特定的生物学意义,例如源自位于启动子中的SNP的单倍型,或源自非同义SNP的单倍型。所有这些单倍型都是“亚单倍型”,因为它们仅指基因中发现的SNP的一部分。到目前为止,亚单体型是直接从选择构成它们的SNP中直接算出的,而没有考虑与基因中其他SNP相对应的其余信息。在当前的工作中,我们描述了一种称为“全局方法”的替代方法,该方法考虑了该地区已知的所有SNP,并比较了“直接”和“全局”两种方法的功效。结果我们使用来自GH1启动子和APOE基因的经验单倍型数据集以及10个模拟数据集,并随机引入缺失信息(从0%到20%)以比较这两种方法。对于每种方法,我们都使用PHASE单体型软件,因为它被认为是最好的。我们表明,使用“全局方法”进行亚谱图分析始终会比传统的直接单体型方法产生更好的错误率。这种替代方法提供的优势随着基因分型数据丢失的百分比的增加(平均错误率从25%降低到小于10%)而增加。我们在GRIV队列中将全球方法软件应用于AIDS遗传关联,发现以前通过直接亚谱法鉴定的一些关联是错误的。结论整体亚谱分析方法可以有时甚至显着降低患者分辨率和单倍型频率的错误率。因此,应该使用这种方法,以最大程度地减少涉及亚单元型的遗传研究中错误解释的风险。在实践中,全局方法总是比直接方法更有效,但是当丢失信息的级别变得更大(> 10%)时,考虑到每个主题中丢失信息的级别的组合方法显得更加有趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号