A Nearly Linear-Time General Algorithm for Genome-Wide Bi-allele Haplotype Phasing

机译：基因组宽双等位基因单倍型逐步算法的几乎线性时间常规算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The determination of feature maps, such as STSs (sequence tag sites), SNPs (single nucleotide polymorphisms) or RFLP (restric-tion fragment length polymorphisms) maps, for each chromosome copy or haplotype in an individual has important potential applications to ge-netics, clinical biology and association studies. Wo consider the problem of reconstructing two haplotypes of a diploid individual from genotype data generated by mapping experiments, and present an algorithm to i-ecover haplotypes. The problem of optimizing existing methods of SNP jpliasing with a population of diploid genotypes has been investigated in [V] and found to be NP-hard. In contrast, using single molecule methods, we show that although haplotypes are not known and data are further confounded by the mapping error model, reasonable assumptions on the mapping process allow us to recover the co-associations of allele types across consecutive loci and estimate the haplotypes with an efficient al-gorithm. The haplotype reconstruction algorithm requires two stages: Stage I is the detection of polymorphic marker types, this is clone by ixiodifying an EM-algorithm for Gaussian mixture models and an exam-ple is given for RFLP sizing. Stage II focuses on the problem of phasing and presents a method of local maximum likelihood for the inference of laaplotypes in an individual. The algorithm presented is nearly linear in ttie number of polymorphic loci. The algorithm results, run on simulated R.FLP sizing data, are encouraging, and suggest that the method will prove practical for haplotype phasing.

机译：特征图的测定，例如STS（序列标记位点），SNPS（单核苷酸多态性）或RFLP（恢raction碎片长度多态性）图，每个染色体拷贝或单倍型在个人中具有重要的GE-Netics潜在应用，临床生物学与关联研究。 WO考虑从通过映射实验生成的基因型数据重建二倍体的两个单倍型的问题，并将算法呈现给I-Ecover单倍型。 [V]研究了优化具有二倍体基因型群的SNP JPLiasing现有方法的问题，发现是NP - 硬。相比之下，使用单分子方法，我们表明，虽然单倍型不是已知的并且数据进一步混淆了映射误差模型，但映射过程的合理假设允许我们在连续基因座上恢复等位基因类型的共同关联并估计具有高效Al-Gorithm的单倍型。单倍型重建算法需要两个阶段：阶段I是检测多态标记类型，这是通过iximizED的IximizED用于高斯混合模型的EM算法，对RFLP施加给出了考试。第二阶段专注于阶段的问题，并提出了一种局部最大可能性的方法，可以在个人中推断出来。呈现的算法几乎是线性的多态基因座的数量。算法结果，在模拟的R.FlP尺寸数据上运行，令人鼓舞，并表明该方法将证明单倍型相位进行实用。

著录项

来源
《International Conference on High Performance Computing》|2003年||共12页
会议地点
作者
Will Casey; Bud Mishra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP302-532;
关键词

相似文献

外文文献
中文文献
专利

1. A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree [J] . En-Yu Lai, Wei-Bung Wang, Tao Jiang, BMC Bioinformatics . 2012,第SUPPLEMENTa17期

机译：在谱系上重建零重组单倍型构型的线性时间算法
2. A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on pedigrees without mating loops [J] . Liu L, Jiang T Journal of combinatorial optimization . 2010,第2期

机译：线性时间算法，用于在没有交配环的情况下在谱系上重建零重组单倍型构型
3. A Linear-Time Algorithm for the Perfect Phylogeny Haplotype Problem [J] . Paola Bonizzoni Algorithmica . 2007,第3期

机译：完美系统发育单倍型问题的线性时间算法
4. A Nearly Linear-Time General Algorithm for Genome-Wide Bi-allele Haplotype Phasing [C] . Will Casey, Bud Mishra International Conference on High Performance Computing . 2003

机译：基因组宽双等位基因单倍型逐步算法的几乎线性时间常规算法
5. Linear-time algorithms for dominators and related problems. [D] . Georgiadis, Loukas. 2005

机译：用于支配者和相关问题的线性时间算法。
6. A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree [O] . En-Yu Lai, Wei-Bung Wang, Tao Jiang, 2012

机译：在谱系上重建零重组单倍型构型的线性时间算法
7. A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree [O] . Lai En-Yu, Wang Wei-Bung, Jiang Tao, 2012

机译：在谱系上重建零重组单倍型构型的线性时间算法

A Nearly Linear-Time General Algorithm for Genome-Wide Bi-allele Haplotype Phasing

摘要

著录项

相似文献

相关主题

期刊订阅