首页> 美国卫生研究院文献>Bioinformatics >DACTAL: divide-and-conquer trees (almost) without alignments
【2h】

DACTAL: divide-and-conquer trees (almost) without alignments

机译:DACTAL:几乎没有对齐的分治树

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: While phylogenetic analyses of datasets containing 1000–5000 sequences are challenging for existing methods, the estimation of substantially larger phylogenies poses a problem of much greater complexity and scale.>Methods: We present DACTAL, a method for phylogeny estimation that produces trees from unaligned sequence datasets without ever needing to estimate an alignment on the entire dataset. DACTAL combines iteration with a novel divide-and-conquer approach, so that each iteration begins with a tree produced in the prior iteration, decomposes the taxon set into overlapping subsets, estimates trees on each subset, and then combines the smaller trees into a tree on the full taxon set using a new supertree method. We prove that DACTAL is guaranteed to produce the true tree under certain conditions. We compare DACTAL to SATé and maximum likelihood trees on estimated alignments using simulated and real datasets with 1000–27 643 taxa.>Results: Our studies show that on average DACTAL yields more accurate trees than the two-phase methods we studied on very large datasets that are difficult to align, and has approximately the same accuracy on the easier datasets. The comparison to SATé shows that both have the same accuracy, but that DACTAL achieves this accuracy in a fraction of the time. Furthermore, DACTAL can analyze larger datasets than SATé, including a dataset with almost 28 000 sequences.>Availability: DACTAL source code and results of dataset analyses are available at .>Contact:
机译:>动机:虽然对包含1000-5000个序列的数据集进行系统发育分析对现有方法具有挑战性,但估计更大的系统发生率却带来了更大的复杂性和规模。>方法:我们提出了DACTAL,这是一种系统发育估计方法,可从未比对的序列数据集中生成树,而无需估计整个数据集上的比对。 DACTAL将迭代与新颖的分治方法相结合,从而使每个迭代都从先前迭代中生成的树开始,将分类单元集分解为重叠的子集,估计每个子集上的树,然后将较小的树合并为一棵树在使用新的超级树方法的完整分类单元集上。我们证明DACTAL在某些条件下可以保证产生真实的树。我们使用模拟和真实数据集(具有1000–27 643个分类单元)在估计比对中将DACTAL树与SATé树和最大似然树进行比较。>结果:我们的研究表明,平均而言,DACTAL比两阶段方法生成的树更准确我们对难以对齐的大型数据集进行了研究,并且在较简单的数据集上具有大致相同的精度。与SATé的比较表明,两者具有相同的精度,但DACTAL只需一小部分时间即可达到此精度。此外,DACTAL可以分析比SATé更大的数据集,包括具有近28000个序列的数据集。>可用性: DACTAL源代码和数据集分析的结果可在。>联系方式:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号