首页> 外文会议>Annual International Conference on Research in Computational Molecular Biology >Information Theoretic Approaches to Whole Genome Phylogenies
【24h】

Information Theoretic Approaches to Whole Genome Phylogenies

机译:全基因组的信息理论方法

获取原文

摘要

We describe a novel method for efficient reconstruction of phylogenetic trees, based on sequences of whole genomes or proteomes. The core of our method is a new measure of pairwise distances between sequences, whose lengths may greatly vary. This measure is based on information theoretic tools (Kullback-Leibler relative entropy). We present an algorithm for efficiently computing these distances. The algorithm uses suffix arrays to compute the distance of two £ long sequences in 0 time. It is fast enough to enable the construction of the phylogenomic tree for hundreds of species, and the phylogenomic forest for almost two thousand viruses. An initial analysis of the results exhibits a remarkable agreement with "acceptable phylogenetic truth". To assess our approach, it was implemented together with a number of alternative approaches, including two that were previously published in the literature. Comparing their outcome to ours, using a "traditional" tree and a standard tree comparison method, ouralgorithm improved upon the "competition" by a substantial margin.
机译:我们描述了一种基于全基因组或蛋白质素的序列有效地重建系统发育树的新方法。我们的方法的核心是序列之间的成对距离的新措施,其长度可能会大大变化。该措施基于信息理论工具(Kullback-Leibler相对熵)。我们提出了一种有效计算这些距离的算法。该算法使用后缀阵列在0次中计算两种长序列的距离。它足够快,可以实现数百种物种和系统核发生物林的系统核糖树的构建,近两千病毒。对结果的初步分析表现出与“可接受的系统发育真理”的显着协议。为了评估我们的方法,它与许多替代方法一起实施,其中包括先前在文献中发表的两种方法。将其结果与我们的结果进行比较,使用“传统”树和标准树比较方法,通过大幅保证金改善了“竞争”的“竞争”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号