...
首页> 外文期刊>BMC Genomics >A generalized Robinson-Foulds distance for labeled trees
【24h】

A generalized Robinson-Foulds distance for labeled trees

机译:标记树木的广义罗宾逊 - FULDS距离

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Abstract Background The Robinson-Foulds (RF) distance is a well-established measure between phylogenetic trees. Despite a lack of biological justification, it has the advantages of being a proper metric and being computable in linear time. For phylogenetic applications involving genes, however, a crucial aspect of the trees ignored by the RF metric is the type of the branching event (e.g. speciation, duplication, transfer, etc). Results We extend RF to trees with labeled internal nodes by including a node flip operation, alongside edge contractions and extensions. We explore properties of this extended RF distance in the case of a binary labeling. In particular, we show that contrary to the unlabeled case, an optimal edit path may require contracting “good” edges, i.e. edges shared between the two trees. Conclusions We provide a 2-approximation algorithm which is shown to perform well empirically. Looking ahead, computing distances between labeled trees opens up a variety of new algorithmic directions.Implementation and simulations available at https://github.com/DessimozLab/pylabeledrf .
机译:摘要背景罗宾逊 - 福尔士(RF)距离是系统发育树质之间的良好衡量标准。尽管缺乏生物学理由,但它具有适当的指标并在线性时间可计算的优点。然而,对于涉及基因的系统发育应用,RF指标忽视的树木的关键方面是分支事件的类型(例如,样品,重复,转移等)。结果我们通过包括节点翻转操作,沿着边缘收缩和扩展将RF与标记的内部节点扩展到树木。在二进制标签的情况下,我们探讨了这种扩展RF距离的属性。特别是,我们表明与未标记的情况相反,最佳编辑路径可能需要收缩“良好”的边缘,即两棵树之间共享的边缘。结论我们提供了一种2近似算法,其显示良好的经验。展望未来,标记树之间的计算距离开辟了各种新的算法方向。https://github.com/dessimozlab/pylabeledrf提供的可用和模拟。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号