首页> 外文会议>COCOON 2008;Annual International Conference on Computing and Combinatorics >Efficient Algorithms for SNP Haplotype Block Selection Problems
【24h】

Efficient Algorithms for SNP Haplotype Block Selection Problems

机译:SNP单倍型基因组选择问题的高效算法

获取原文

摘要

Global patterns of human DNA sequence variation (haplo-types) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. Recent genetics research reveals that SNPs within certain haplotype blocks induce only a few distinct common haplotypes in the majority of the population. The existence of haplotype block structure has serious implications for association-based methods for the mapping of disease genes. Our ultimate goal is to select haplotype block designations that best capture the structure within the data. Here in this paper we propose several efficient combinatorial algorithms related to selecting interesting haplotype blocks under different diversity functions that generalizes many previous results in the literatures. In particular, given an m × n haplotype matrix A, we show linear time algorithms for finding all interval diversities, farthest sites, and the longest block within A. For selecting the multiple long blocks with diversity constraint, we show that selecting k blocks with longest total length can be be found in O(nk) time. We also propose linear time algorithms in calculating the all intra-longest-blocks and all intra-k-longest-blocks.
机译:由常见的单核苷酸多态性(SNP)定义的人类DNA序列变异(单倍型)的整体模式对于识别疾病关联和人类特征具有重要意义。最近的遗传学研究表明,某些单倍型模块内的SNP在大多数人群中仅诱导几种不同的常见单倍型。单倍型块结构的存在对疾病基因作图的基于关联的方法具有重要意义。我们的最终目标是选择能最好地捕获数据内结构的单倍型模块代号。在本文中,我们提出了几种与在不同分集函数下选择感兴趣的单倍型模块有关的有效组合算法,这些算法将文献中的许多先前结果归纳为一般性方法。特别地,给定一个m×n单倍型矩阵A,我们展示了线性时间算法,用于查找A中的所有区间分集,最远的位点和最长的块。对于选择具有多样性约束的多个长块,我们展示了选择具有最长总长度以O(nk)时间为单位。我们还提出了线性时间算法来计算所有帧内最长块和所有k内最长块。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号