Perfect phylogeny and haplotype assignment

机译：完善的系统发育和单倍型分配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper is concerned with the reconstruction of perfect phylogenies from binary character data with missing values, and related problems of inferring complete haplotypes from haplotypes or genotypes with missing data. In cases where the problems considered are NP-hard we assume a rich data hypothesis under which they become tractable. Natural probabilistic models are introduced for the generation of character vectors, haplotypes or genotypes with missing data, and it is shown that these models support the rich data hypothesis. The principal results include:

A near-linear time algorithm for inferring a perfect phylogeny from binary character data (or haplotype data) with missing values, under the rich data hypothesis;
A quadratic-time algorithm for inferring a perfect phylogeny from genotype data with missing values with high probability, under certain distributional assumptions;
Demonstration that the problems of maximum-likelihood inference of complete haplotypes from partial haplotypes or partial genotypes can be cast as minimum-entropy disjoint set cover problems;
In the case where the haplotypes come from a perfect phylogeny, a representation of the set cover problem as minimum-entropy covering of subtrees of a tree by nodes;
An exact algorithm for minimum-entropy subtree covering, and demonstration that it runs in polynomial time when the subtrees have small diameter;
Demonstration that a simple greedy approximation algorithm solves the minimum-entropy subtree covering problem with relative error tending to zero when the number of partial haplotypes per complete haplotype is large;
An asymptotically consistent method of estimating the frequencies of the complete haplotypes in a perfect phylogeny, under an iid model for the distribution of missing data;
Computational results on real data demonstrating the effectiveness of a the greedy algorithm for inferring haplotypes from genotypes with missing data, even inthe absence of a perfect phylogeny.

机译：本文涉及从具有缺失值的二元字符数据重建完美的系统发育，以及与从具有缺失数据的单倍型或基因型中推断出完整单倍型有关的问题。如果考虑的问题是 NP 困难的情况，我们假设丰富的数据假设使它们变得易于处理。引入自然概率模型来生成缺少数据的字符向量，单倍型或基因型，并证明这些模型支持丰富的数据假设。主要结果包括：

一种近线性时间算法，可根据丰富的数据假设从具有缺失值的二进制字符数据（或单倍型数据）推断出完美的系统发育;
一种二次时间算法，可以在某些分布假设下，从具有缺失值的基因型数据中推断出完美的系统发育;
论证了从部分单倍型或部分基因型推断出完整单倍型的最大似然性问题可以解释为最小熵不相交集覆盖问题;
在单倍型来自完美的系统发育的情况下，将集覆盖问题表示为节点对树的子树的最小熵覆盖;
一种最小熵子树覆盖的精确算法，并证明了当子树直径较小时，它可以在多项式时间内运行;
证明了一种简单的贪婪近似算法可以解决最小熵子树覆盖问题，当每个完整单元型的部分单元型的数量较大时，相对误差趋于零;
在缺失数据分布的iid模型下，一种渐进一致的方法，用于估计理想系统发育中完整单倍型的频率;
真实数据的计算结果证明了贪心算法从缺少数据的基因型中推断单倍型的有效性，即使在没有完善的系统发育的情况下也是如此。。展开▼

著录项

来源
《International conference on Computational molecular biology;Annual international conference on Computational molecular biology》|2004年|P.10-19|共10页

会议地点

作者
Eran Halperin; Richard M. Karp;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类计算技术、计算机技术;

关键词
phasing;

机译：分阶段;

相似文献

外文文献

中文文献

专利

1. Haplotype reconstruction using perfect phylogeny and sequence data [J] . Anatoly Efros, Eran Halperin BMC Bioinformatics . 2012,第SUPPLEMENTa6期

机译：利用完善的系统发育和序列数据进行单倍型重建

2. Computational Problems in Perfect Phylogeny Haplotyping: Typing without Calling the Allele [J] . Barzuza Tamar, Beckmann Jacques, Shamir Ron, IEEE/ACM transactions on computational biology and bioinformatics . 2008,第1期

机译：完善的系统发育单倍型的计算问题：不调用等位基因进行打字

3. Shorelines of Islands of Tractability: Algorithms for Parsimony and Minimum Perfect Phylogeny Haplotyping Problems [J] . van Iersel Leo, Keijsper Judith, Kelk Steven, IEEE/ACM transactions on computational biology and bioinformatics . 2008,第2期

机译：可移动性岛屿的海岸线：简约和最小系统发育单体型问题的算法

4. Perfect Phylogeny and Haplotype Assignment [C] . Eran Halperin, Richard M. Karp Annual International Conference on Research in Computational Molecular Biology . 2004

机译：完美的系统发育和单倍型分配

5. Chordal graph theory and its applications to perfect phylogeny. [D] . Gysel, Robert Simon. 2010

机译：弦图理论及其在完善系统发育中的应用。

6. Haplotype reconstruction using perfect phylogeny and sequence data [O] . Anatoly Efros, Eran Halperin 2012

机译：利用完善的系统发育和序列数据进行单倍型重建

7. Perfect phylogeny and haplotype assignment [O] . Eran Halperin, Richard M. Karp 2004

机译：完善的系统发育和单倍型分配

1. 确立新型收入分配观,完善收入分配制度--完善社会主义市场经济体制的分析 [J] . 课题组 . 探索 . 2006,第003期

2. 完善利益分配机制发展农业产业化经营促进农村经济发展和农民增收——在自治区完善农业产业化经营利益分配机制经验交流研讨会上的讲话 [J] . 阿不都热依木·阿米提 . 决策通讯 . 2001,第011期

3. 实现价值理论、分配理论和分配制度的“三统一”--关于完善我国分配制度的一些思考 [J] . 朱妙宽 . 东方论坛 . 2013,第002期

4. 进一步建立和完善以按劳分配为主体　多种分配方式并存的新的分配制度 [J] . 艾沙·木沙 . 实事求是 . 1998,第0S1期

5. 试论按劳分配实践的突破与发展——兼谈学习党的十五大报告关于完善分配结构和分配方式论述体会之一 [J] . 黄少琴 . 广西广播电视大学学报 . 1998,第002期

6. 完善评价机制活化分配模式切实发挥内部工资分配杠杆调节作用 [C] . 刘明 . 2016铁路企业搞活内部工资分配交流会 . 2016

7. 论民事执行参与分配制度的完善——以执行分配异议为视角 [A] . 汪德锋 . 2015

1. 能够中和系统发育群1和系统发育群2的A型流感病毒以及B型流感病毒的人结合分子 [P] . 中国专利： CN103906763B . 2016.10.12

2. 能够中和系统发育群1和系统发育群2的A型流感病毒以及B型流感病毒的人结合分子 [P] . 中国专利： CN103906763A . 2014-07-02

3. METHODS FOR HAPLOTYPE ASSIGNMENT [P] . 外国专利： WO2005048012A3 . 2006-05-18

机译：原型分配方法

4. METHODS FOR HAPLOTYPE ASSIGNMENT [P] . 外国专利： WO2005048012A2 . 2005-05-26

机译：原型分配方法

5. APPARATUS AND METHOD FOR MALWARE LINEAGE INFERENCE SYSTEM WITH GENERATING PHYLOGENY [P] . 外国专利： KR20210108154A . 2021-09-02

机译：用于生成系统发育的恶意软件谱系推理系统的装置和方法

相关主题

Perfect phylogeny and haplotype assignment

摘要

著录项

相似文献

相关主题

期刊订阅