首页> 外文会议>The 7th Asia-Pacific Bioinformatics Conference(第七届亚太生物信息学大会) >CGTS: a site-clustering graph based tagSNP selection algorithm in genotype data
【24h】

CGTS: a site-clustering graph based tagSNP selection algorithm in genotype data

机译:CGTS:基因型数据中基于站点聚类图的tagSNP选择算法

获取原文

摘要

Background: Recent studies have shown genetic variation is the basis of the genome-wide disease association research. However, due to the high cost on genotyping large number of single nucleotide polymorphisms (SNPs), it is essential to choose a small subset of informative SNPs (tagSNPs), which are able to capture most variation in a population, to represent the rest SNPs. Several methods have been proposed to find the minimum set of tagSNPs, but most of them still have some disadvantages such as information loss and block-partition limit.Results: This paper proposes a new hybrid method named CGTS which combines the ideas of the clustering and the graph algorithms to select tagSNPs on genotype data. This method aims to maximize the number of the discarding nontagSNPs in the given set. CGTS integrates the information of the LD association and the genotype diversity using the site graphs, discards redundant SNPs using the algorithm based on these graph structures. The clustering algorithm is used to reduce the running time of CGTS. The efficiency of the algorithm and quality of solutions are evaluated on biological data and the comparisons with three popular selecting methods are shown in the paper.Conclusions: Our theoretical analysis and experimental results show that our algorithm CGTS is not only more efficient than other methods but also can be get higher accuracy in tagSNP selection.
机译:背景:最近的研究表明,遗传变异是全基因组疾病关联研究的基础。但是,由于对大量单核苷酸多态性(SNP)进行基因分型的成本很高,因此必须选择一小部分信息丰富的SNP(tagSNPs),这些子集能够捕获种群中的大多数变异,以代表其余的SNP。 。已经提出了几种方法来寻找最小的tagSNP集,但是大多数方法仍然存在诸如信息丢失和块分割限制之类的缺点。结果:本文提出了一种新的名为CGTS的混合方法,该方法结合了聚类和聚类的思想。图算法选择基因型数据上的tagSNP。该方法旨在最大化给定集合中丢弃的nontagSNP的数量。 CGTS使用站点图整合了LD关联和基因型多样性的信息,并使用基于这些图结构的算法丢弃了多余的SNP。聚类算法用于减少CGTS的运行时间。结合生物学数据对算法的效率和解的质量进行了评价,并与三种常用的选择方法进行了比较。结论:理论分析和实验结果表明,CGTS算法不仅比其他方法更有效,而且算法更有效。在tagSNP选择中也可以获得更高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号