首页> 外文会议>Algorithms - ESA 2009 >An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants

【24h】

An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants

机译：一种带有少量重组子的谱系单倍型推断的高效算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Combinatorial (or rule-based) methods for inferring haplotypes from genotypes on a pedigree have been studied extensively in the recent literature. These methods generally try to reconstruct the haplotypes of each individual so that the total number of recombinants is minimized in the pedigree. The problem is NP-hard, although it is known that the number of recombinants in a practical dataset is usually very small. In this paper, we consider the question of how to efficiently infer haplotypes on a large pedigree when the number of recombinants is bounded by a small constant, i.e. the so called k-recombinant haplotype configuration (k-RHC) problem. We introduce a simple probabilistic model for k-RHC where the prior haplotype probability of a founder and the haplotype transmission probability from a parent to a child are all assumed to follow the uniform distribution and k random recombinants are assumed to have taken place uniformly and independently in the pedigree. We present an O(mn log~(k+1) n) time algorithm for k-RHC on tree pedigrees without mating loops, where m is the number of loci and n is the size of the input pedigree, and prove that when 90 log re < m < n~3, the algorithm can correctly find a feasible haplotype configuration that obeys the Mendelian law of inheritance and requires no more than k recombinants withrnprobability 1 - O(k~2 log~2n/mn + 1~2). The algorithm is efficient when k is of a moderate value and could thus be used to infer haplotypes from genotypes on large tree pedigrees efficiently in practice. We have implemented the algorithm as a C++ program named Tree-k-RHC. The implementation incorporates several ideas for dealing with missing data and data with a large number of recombinants effectively. Our experimental results on both simulated and real datasets show that TREE-k-RHC can reconstruct haplotypes with a high accuracy and is much faster than the best combinatorial method in the literature.

机译：在最近的文献中已经广泛研究了从谱系上的基因型推导单倍型的组合（或基于规则）方法。这些方法通常尝试重建每个个体的单倍型，以使重组的总数在谱系中最小化。尽管已知实用数据集中的重组体数量通常很少，但是问题是NP困难的。在本文中，我们考虑了以下问题：当重组子的数量由一个小的常数限制时，即所谓的k重组单倍型构型（k-RHC）问题，如何在一个大的谱系上有效地推断单倍型。我们为k-RHC引入一个简单的概率模型，其中假定创始人的先前单倍型概率和从父母到孩子的单倍型传播概率均遵循均匀分布，并且假设k个随机重组体均发生且独立发生在血统书中。我们提出了不带交配环的树谱系上k-RHC的O（mn log〜（k + 1）n）时间算法，其中m是基因座数，n是输入谱系的大小，并证明当90 log re

著录项

来源
《Algorithms - ESA 2009》|2009年|325-336|共12页
会议地点 Copenhagen(DK);Copenhagen(DK);Copenhagen(DK)
作者
Jing Xiao; Tiancheng Lou; Tao Jiang;
展开▼
作者单位

IBM China Research Lab, Beijing, China;

Department of Computer Science and Technology, Tsinghua University, Beijing, China;

Department of Computer Science and Engineering, University of California, Riverside, CA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
computational biology; haplotype inference; pedigree; recombina tion; combinatorial algorithm; probabilistic model;

机译：计算生物学；单倍型推断谱系;重组组合算法概率模型;

相似文献

外文文献
中文文献
专利

1. An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants [J] . Jing Xiao, Tiancheng Lou, Tao Jiang Algorithmica . 2012,第3a4期

机译：一种带有少量重组子的谱系单倍型推断的高效算法
2. EFFICIENT ALGORITHMS FOR RECONSTRUCTING ZERO-RECOMBINANT HAPLOTYPES ON A PEDIGREE BASED ON FAST ELIMINATION OF REDUNDANT LINEAR EQUATIONS [J] . JING XIAO, LAN LIU, LIRONG XIA, SIAM Journal on Computing . 2009,第6期

机译：基于冗余线性方程组快速消去的，在谱系上重建零重组单型的有效算法
3. An Efficient Algorithm for Haplotype Inference on Pedigrees with Recombinations and Mutations [J] . Pirola Yuri, Bonizzoni Paola, Jiang Tao Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2012,第1期

机译：具有重组和变异的家系的单倍型推断的高效算法
4. An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants [C] . Jing Xiao, Tiancheng Lou, Tao Jiang Annual European Symposium on Algorithms . 2009

机译：具有少量重组剂的二倍型推理的高效算法
5. Algorithms for Inferring Haplotypes from Genotype Data of Pedigrees. [D] . Doan, Duong Dai. 2011

机译：从谱系基因型数据推断单倍型的算法。
6. alleHap: an efficient algorithm to reconstruct zero-recombinant haplotypes from parent-offspring pedigrees [O] . Nathan Medina-Rodríguez, Angelo Santana, Ana M Wägner, 2014

机译：alleHap：一种有效的算法可从亲代后代谱系中重建零重组单倍型
7. An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants [O] . Jing Xiao, Tiancheng Lou, Tao Jiang 2011

机译：带有少量重组子的谱系的单倍型推断的高效算法

An Efficient Algorithm for Haplotype Inference on Pedigrees with a Small Number of Recombinants

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅