首页> 外文会议>International workshop on algorithms in bioinformatics >Practical Algorithms and Fixed-Parameter Tractability for the Single Individual SNP Haplotyping Problem
【24h】

Practical Algorithms and Fixed-Parameter Tractability for the Single Individual SNP Haplotyping Problem

机译:单个单独的SNP单倍型问题的实用算法和固定参数遗传性

获取原文

摘要

Single nucleotide polymorphisms (SNPs) are the most frequent form of human genetic variation, of foremost importance for a variety of applications including medical diagnostic, phylogenies and drug design. The complete SNPs sequence information from each of the two copies of a given chromosome in a diploid genome is called a haplotype. The Hyplotyping Problem for a single individual is as follows: given a set of fragments from one individual's DNA, find a maximally consistent pair of SNPs haplotypes (one per chromosome copy) by removing data "errors" related to sequencing errors, repeats, and paralogous recruitment. Two versions of the problem, i.e. the Minimum Fragment Removal (MFR) and the Minimum SNP Removal (MSR), are considered. The Haplotyping Problem was introduced in [8], where it was proved that both MSR and MFR are polynomially solvable when each fragment covers a set of consecutive SNPs (i.e., it is a gapless fragment), and NP-hard in general. The original algorithms of [8] are of theoretical interest, but by no means practical. In fact, one relies on finding the maximum stable set in a perfect graph, and the other is a reduction to a network flow problem. Furthermore, the reduction does not work when there are fragments completely included in others, and neither algorithm can be generalized to deal with a bounded total number of holes in the data. In this paper, we give the first practical algorithms for the Haplotyping Problem, based on Dynamic Programming. Our algorithms do not require the fragments to not include each other, and are polynomial for each constant k bounding the total number of holes in the data. For m SNPs and n fragments, we give an O(mn~(2k+2)) algorithm for the MSR problem, and an O(2~(2k)m~2n + 2~(3k)m~3) algorithm for the MFR problem, when each fragment has at most k holes. In particular, we obtain an O(mn~2) algorithm for MSR and an (O(m~2n + m~3) algorithm for MFR on gapless fragments. Finally, we prove that both MFR and MSR are APX-hard in general.
机译:单核苷酸多态性(SNP)是最常见的人类遗传变异形式,最重要的是各种应用,包括医疗诊断,文学和药物设计。来自二倍体基因组中给定染色体的两个拷贝中的每一个的完整SNP序列信息称为单倍型。单个个体的闭型问题如下:给定一组来自​​一个个体的DNA的片段,通过去除与测序误差,重复和递质相关的数据“误差”找到最大一致的SNPS单倍型(每个染色体拷贝)招聘。考虑了两个问题的问题,即最小分段删除(MFR)和最小SNP移除(MSR)。在[8]中引入了单倍分型问题,其中证明了当每个片段覆盖一组连续的SNP时,MSR和MFR都是多项式可溶性的,并且一般是NP - 硬质的。 [8]的原始算法具有理论兴趣,但绝不是实际的。事实上,一个依赖于找到完美图中的最大稳定集,另一个是对网络流动问题的降低。此外,当存在完全包括在其他片段时,减少不起作用,并且既不能算法都可以广泛地处理数据中的界限总数。在本文中,我们基于动态编程,给出了单倍型问题的第一个实用算法。我们的算法不需要碎片不包括彼此,并且对于每个常数K限定数据中的总孔数的多项式。对于M SNP和N片段,我们为MSR问题提供O(2K + 2))算法,以及O(2〜(2k)m〜2n + 2〜(3k)m〜3)算法当每个片段最多的k孔时,MFR问题。特别地,我们获得了MSR的O(MN〜2)算法和用于在无间隙碎片上的MFR的(O(M〜2n + M〜3)算法。最后,我们证明了MFR和MSR一般都是APX艰难的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号