首页> 外文期刊>Bioinformatics >Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood.
【24h】

Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood.

机译:在基于最大似然的系统树搜索方法中提高SPR移动的效率。

获取原文
获取原文并翻译 | 示例
           

摘要

MOTIVATION: Maximum likelihood (ML) methods have become very popular for constructing phylogenetic trees from sequence data. However, despite noticeable recent progress, with large and difficult datasets (e.g. multiple genes with conflicting signals) current ML programs still require huge computing time and can become trapped in bad local optima of the likelihood function. When this occurs, the resulting trees may still show some of the defects (e.g. long branch attraction) of starting trees obtained using fast distance or parsimony programs. METHODS: Subtree pruning and regrafting (SPR) topological rearrangements are usually sufficient to intensively search the tree space. Here, we propose two new methods to make SPR moves more efficient. The first method uses a fast distance-based approach to detect the least promising candidate SPR moves, which are then simply discarded. The second method locally estimates the change in likelihood for any remaining potential SPRs, as opposed to globally evaluating the entire tree for each possible move. These two methods are implemented in a new algorithm with a sophisticated filtering strategy, which efficiently selects potential SPRs and concentrates most of the likelihood computation on the promising moves. RESULTS: Experiments with real datasets comprising 35-250 taxa show that, while indeed greatly reducing the amount of computation, our approach provides likelihood values at least as good as those of the best-known ML methods so far and is very robust to poor starting trees. Furthermore, combining our new SPR algorithm with local moves such as PHYML's nearest neighbor interchanges, the time needed to find good solutions can sometimes be reduced even more.
机译:动机:最大似然(ML)方法已非常受欢迎,可用于从序列数据构建系统发育树。但是,尽管最近取得了显着进展,但是对于大型且困难的数据集(例如具有冲突信号的多个基因),当前的ML程序仍然需要大量的计算时间,并且可能陷入似然函数的不良局部最优中。发生这种情况时,生成的树可能仍显示出使用快速距离或简约程序获得的起始树的​​某些缺陷(例如,长的树枝吸引)。方法:子树修剪和移植(SPR)拓扑重排通常足以密集搜索树空间。在这里,我们提出了两种新方法来提高SPR的移动效率。第一种方法使用基于快速距离的方法来检测最不可能的候选SPR移动,然后将其简单地丢弃。第二种方法是本地估计任何剩余潜在SPR的可能性变化,这与针对每个可能的移动全局评估整棵树相反。这两种方法是在具有复杂过滤策略的新算法中实现的,该算法可以有效地选择潜在的SPR,并将大部分似然计算集中在有希望的动作上。结果:对包含35-250个分类单元的真实数据集进行的实验表明,尽管确实大大减少了计算量,但我们的方法提供的似然值至少与迄今为止最著名的ML方法的似然值相同,并且对于不良的开始非常鲁棒树木。此外,将我们的新SPR算法与诸如PHYML的最近邻居互换之类的本地移动方法相结合,有时甚至可以减少寻找更好解决方案所需的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号