A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality

Thu-Hien To; Edwin Jacox; Vincent Ranwez; Celine Scornavacca

首页> 外文期刊>BMC Bioinformatics >A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality

【24h】

A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality

机译：一种通过帕累托最优计算树协调中可靠事件支持的快速方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given a gene and a species tree, reconciliation methods attempt to retrieve the macro-evolutionary events that best explain the discrepancies between the two tree topologies. The DTL parsimonious approach searches for a most parsimonious reconciliation between a gene tree and a (dated) species tree, considering four possible macro-evolutionary events (speciation, duplication, transfer, and loss) with specific costs. Unfortunately, many events are erroneously predicted due to errors in the input trees, inappropriate input cost values or because of the existence of several equally parsimonious scenarios. It is thus crucial to provide a measure of the reliability for predicted events. It has been recently proposed that the reliability of an event can be estimated via its frequency in the set of most parsimonious reconciliations obtained using a variety of reasonable input cost vectors. To compute such a support, a straightforward but time-consuming approach is to generate the costs slightly departing from the original ones, independently compute the set of all most parsimonious reconciliations for each vector, and combine these sets a posteriori. Another proposed approach uses Pareto-optimality to partition cost values into regions which induce reconciliations with the same number of DTL events. The support of an event is then defined as its frequency in the set of regions. However, often, the number of regions is not large enough to provide reliable supports. We present here a method to compute efficiently event supports via a polynomial-sized graph, which can represent all reconciliations for several different costs. Moreover, two methods are proposed to take into account alternative input costs: either explicitly providing an input cost range or allowing a tolerance for the over cost of a reconciliation. Our methods are faster than the region based method, substantially faster than the sampling-costs approach, and have a higher event-prediction accuracy on simulated data. We propose a new approach to improve the accuracy of event supports for parsimonious reconciliation methods to account for uncertainty in the input costs. Furthermore, because of their speed, our methods can be used on large gene families. Our algorithms are implemented in the ecceTERA program, freely available from http://mbb.univ-montp2.fr/MBB/ .

机译：给定一棵基因和一个物种树，和解方法试图检索最能解释这两种树拓扑之间差异的宏观进化事件。 DTL简约方法在基因树和（过时的）物种树之间寻找最简约的和解，同时考虑了四个可能的宏观进化事件（物种形成，复制，转移和损失），并付出了特定的代价。不幸的是，由于输入树中的错误，不适当的输入成本值或由于存在几个同样简约的场景，所以错误地预测了许多事件。因此，至关重要的是提供一种预测事件的可靠性度量。最近已经提出，可以通过使用各种合理的输入成本向量获得的最简约对账集中的事件的频率来估计事件的可靠性。为了计算这样的支持，一种直接但费时的方法是生成与原始成本略有不同的成本，为每个向量独立计算所有最简约对帐的集合，并将这些集合组合为后验。另一种建议的方法是使用帕累托最优性将成本值划分为多个区域，这些区域会引发具有相同数量DTL事件的对帐。然后将事件的支持定义为其在区域集中的频率。但是，通常区域的数量不足以提供可靠的支持。我们在这里提出一种通过多项式大小的图有效计算事件支持的方法，该图可以表示几种不同成本的所有对帐。此外，提出了两种方法来考虑替代的输入成本：要么明确提供输入成本范围，要么允许对帐超额成本。我们的方法比基于区域的方法快，比采样成本方法快得多，并且对模拟数据的事件预测准确性更高。我们提出了一种新方法来提高事件支持的精简对帐方法的准确性，以解决投入成本中的不确定性。此外，由于其速度快，我们的方法可用于大型基因家族。我们的算法在ecceTERA程序中实现，可从http://mbb.univ-montp2.fr/MBB/免费获得。

著录项

来源
《BMC Bioinformatics》 |2015年第1期|共页
作者
Thu-Hien To; Edwin Jacox; Vincent Ranwez; Celine Scornavacca;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. Pareto-optimal phylogenetic tree reconciliation [J] . Libeskind-Hadas Ran, Wu Yi-Chieh, Bansal Mukul S., Bioinformatics . 2014,第12期

机译：帕累托最优系统发育树和解
2. Inferring Pareto-optimal reconciliations across multiple event costs under the duplication-loss-coalescence model [J] . Ross Mawhorter, Nuo Liu, Ran Libeskind-Hadas, BMC Bioinformatics . 2019,第S20期

机译：在重复损失结合模型下推断出多次事件成本的静态和解
3. Structural properties of the reconciliation space and their applications in enumerating nearly-optimal reconciliations between a gene tree and a species tree [J] . Taoyang Wu, Louxin Zhang BMC Bioinformatics . 2011,第SUPPLEMENTa9期

机译：和解空间的结构特性及其在枚举基因树和物种树之间的近乎最佳和解中的应用
4. A Clustering Method Based on Dynamic Self Organizing Trees for Post-Pareto Optimality Analysis [C] . Oswaldo Aguirre, Heidi Taboada Complex Adaptive Systems . 2011

机译：基于动态自组织树木后帕施加工优化分析的聚类方法
5. Post-Pareto optimality methods for the analysis of large Pareto sets in multi-objective optimization. [D] . Carrillo, Victor M. 2013

机译：多目标优化中用于分析大型Pareto集的后Pareto最优方法。
6. A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality [O] . Thu-Hien To, Edwin Jacox, Vincent Ranwez, 2015

机译：通过帕累托最优计算树协调中可靠事件支持的快速方法
7. A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality [O] . 2015

机译：通过帕累托最优计算树协调中可靠事件支持的快速方法

A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality

摘要

著录项

相似文献

相关主题

期刊订阅