首页> 外文会议>Annual European Symposium on Algorithms >An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants

【24h】

An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants

机译：具有少量重组剂的二倍型推理的高效算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Combinatorial (or rule-based) methods for inferring haplotypes from genotypes on a pedigree have been studied extensively in the recent literature. These methods generally try to reconstruct the haplotypes of each individual so that the total number of recombinants is minimized in the pedigree. The problem is NP-hard, although it is known that the number of recombinants in a practical dataset is usually very small. In this paper, we consider the question of how to efficiently infer haplotypes on a large pedigree when the number of recombinants is bounded by a small constant, i.e. the so called k-recombinant haplotype configuration (k-RHC) problem. We introduce a simple probabilistic model for k-RHC where the prior haplotype probability of a founder and the haplotype transmission probability from a parent to a child are all assumed to follow the uniform distribution and k random recombinants are assumed to have taken place uniformly and independently in the pedigree. We present an O(mn (log n)~(k+1)) time algorithm for k-RHC on tree pedigrees without mating loops, where m is the number of loci and n is the size of the input pedigree, and prove that when 90 log n < m < n~3, the algorithm can correctly find a feasible haplotype configuration that obeys the Mendelian law of inheritance and requires no more than k recombinants with probability 1 - O(k~2 (log n)~2/m + 1/n~2). The algorithm is efficient when k is of a moderate value and could thus be used to infer haplotypes from genotypes on large tree pedigrees efficiently in practice. We have implemented the algorithm as a C++ program named TREE-K-RHC. The implementation incorporates several ideas for dealing with missing data and data with a large number of recombinants effectively, Our experimental results on both simulated and real datasets show that TREE-K-RHC can reconstruct haplotypes with a high accuracy and is much faster than the best combinatorial method in the literature.

机译：从一个谱系的基因型推断单倍型组合（或规则为基础的）方法已经在最近的文献中广泛研究。这些方法通常试图重建该重组体的总数在系谱被最小化每个单独的这样的单倍型。这个问题是NP难的，但众所周知，重组的实际数据集的数量通常很小。在本文中，我们考虑如何高效地推断在一个大谱系的单倍型的问题，当重组体的数量是由一个小的常数为界，即所谓的k重组单倍型构型（K-RHC）的问题。我们推出了K-RHC一个简单的概率模型，其中一个创始人的前单倍型概率，并从父到子单倍型传输概率都被假定为遵循均匀分布和k随机重组假设已经统一和独立发生在谱系。我们提出了一个O（MN（log n）的〜第（k + 1））对于k-RHC时间算法树系谱没有配套环，其中m是基因座的数目，n是输入谱系的大小，并证明当90日志N

著录项

来源
《Annual European Symposium on Algorithms》|2009年||共12页
会议地点
作者
Jing Xiao; Tiancheng Lou; Tao Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Computational biology; Haplotype inference; Pedigree; Recombination; Combinatorial algorithm; Probabilistic model;

机译：计算生物学;单倍型推理;谱系;重组;组合算法;概率模型;

相似文献

外文文献
中文文献
专利

1. An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants [J] . Jing Xiao, Tiancheng Lou, Tao Jiang Algorithmica . 2012,第3a4期

机译：一种带有少量重组子的谱系单倍型推断的高效算法
2. EFFICIENT ALGORITHMS FOR RECONSTRUCTING ZERO-RECOMBINANT HAPLOTYPES ON A PEDIGREE BASED ON FAST ELIMINATION OF REDUNDANT LINEAR EQUATIONS [J] . JING XIAO, LAN LIU, LIRONG XIA, SIAM Journal on Computing . 2009,第6期

机译：基于冗余线性方程组快速消去的，在谱系上重建零重组单型的有效算法
3. An Efficient Algorithm for Haplotype Inference on Pedigrees with Recombinations and Mutations [J] . Pirola Yuri, Bonizzoni Paola, Jiang Tao Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2012,第1期

机译：具有重组和变异的家系的单倍型推断的高效算法
4. An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants [C] . Jing Xiao, Tiancheng Lou, Tao Jiang Algorithms - ESA 2009 . 2009

机译：一种带有少量重组子的谱系单倍型推断的高效算法
5. Algorithms for Inferring Haplotypes from Genotype Data of Pedigrees. [D] . Doan, Duong Dai. 2011

机译：从谱系基因型数据推断单倍型的算法。
6. alleHap: an efficient algorithm to reconstruct zero-recombinant haplotypes from parent-offspring pedigrees [O] . Nathan Medina-Rodríguez, Angelo Santana, Ana M Wägner, 2014

机译：alleHap：一种有效的算法可从亲代后代谱系中重建零重组单倍型
7. An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants [O] . Jing Xiao, Tiancheng Lou, Tao Jiang 2011

机译：带有少量重组子的谱系的单倍型推断的高效算法

An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants

摘要

著录项

相似文献

相关主题

期刊订阅