首页> 外文会议>International computing and combinatorics conference >Cophenetic Distances: A Near-Linear Time Algorithmic Framework
【24h】

Cophenetic Distances: A Near-Linear Time Algorithmic Framework

机译:Cophenetic距离:近线性时间算法框架

获取原文

摘要

Tree metrics that compare pairs of trees are an elementary tool for analyzing phylogenetic trees. The cophenetic distance is a classic vector-based tree metric introduced by Cardona et al. that originates from the pioneering work of Sokal and Rohlf more than 50 years ago. However, when faced with phylogenetic analyses where sets of large-scale trees are compared, the quadratic runtime of the current best-known (naieve) algorithm to compute the cophenetic distance becomes prohibitive. Here we describe an algorithmic framework that computes the cophenetic distance under the L_1-norm in O(n log~2 n) time, where n is the size of the compared pair of trees. Based on the work from Sokal and Rohlf, we introduce a natural class of cophenetic distances and show that our algorithmic framework can compute each member of this class in O(nlog~2n) time. In addition, we present a modification of this framework for computing these distances under the L_2-norm in O(n log n) time. Finally, we demonstrate the scalability of our algorithm.
机译:比较树木对的树木指标是分析系统树的基本工具。 cophenetic距离是Cardona等人提出的一种经典的基于向量的树度量。这源于50多年前索卡(Sokal)和罗尔夫(Rohlf)的开拓性工作。但是,当面对系统发育分析(其中比较大型树的集合)时,当前最著名的(天真)算法来计算行距的二次方运行时间变得令人望而却步。在这里,我们描述了一种算法框架,该算法在O(n log〜2 n)的时间内计算L_1范数下的行间距离,其中n是比较的树对的大小。在Sokal和Rohlf的工作基础上,我们引入了自然距离间隔,并证明我们的算法框架可以在O(nlog〜2n)时间内计算该距离的每个成员。另外,我们提出了该框架的一种修改形式,用于在O(n log n)时间的L_2范数下计算这些距离。最后,我们演示了算法的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号