...
首页> 外文期刊>Molecular phylogenetics and evolution >Hide and seek: Placing and finding an optimal tree for thousands of homoplasy-rich sequences
【24h】

Hide and seek: Placing and finding an optimal tree for thousands of homoplasy-rich sequences

机译:捉迷藏:为成千上万的同质性丰富的序列放置并找到最佳树

获取原文
获取原文并翻译 | 示例

摘要

Finding optimal evolutionary trees from sequence data is typically an intractable problem, and there is usually no way of knowing how close to optimal the best tree from some search truly is. The problem would seem to be particularly acute when we have many taxa and when that data has high levels of homoplasy, in which the individual characters require many changes to fit on the best tree. However, a recent mathematical result has provided a precise tool to generate a short number of high-homoplasy characters for any given tree, so that this tree is provably the optimal tree under the maximum parsimony criterion. This provides, for the first time, a rigorous way to test tree search algorithms on homoplasy-rich data, where we know in advance what the 'best' tree is. In this short note we consider just one search program (TNT) but show that it is able to locate the globally optimal tree correctly for 32,768 taxa, even though the characters in the dataset require, on average, 1148 state-changes each to fit on this tree, and the number of characters is only 57.
机译:从序列数据中找到最佳进化树通常是一个棘手的问题,并且通常没有办法知道从某些搜索中找到的最佳进化树到底有多接近。当我们有许多类群并且该数据具有高水平的同质性时,该问题似乎特别严重,其中单个字符需要进行许多更改才能适合最佳树。但是,最近的数学结果提供了一种精确的工具,可以为任何给定的树生成少量的高同形字符,因此该树在最大简约准则下可证明是最优树。这首次提供了一种严格的方法来测试富含同质异构数据的树搜索算法,在此我们可以预先知道“最佳”树是什么。在本简短说明中,我们仅考虑一个搜索程序(TNT),但表明即使数据集中的字符平均每个字符需要1148个状态更改才能适合32,768个分类单元正确定位全局最优树。这棵树,字符数只有57。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号