首页> 外文会议>Comparative genomics >TreeShrink: Efficient Detection of Outlier Tree Leaves
【24h】

TreeShrink: Efficient Detection of Outlier Tree Leaves

机译:TreeShrink:有效检测离群的树叶

获取原文
获取原文并翻译 | 示例

摘要

Phylogenetic trees include errors for a variety of reasons. We argue that one way to detect errors is to build a phylogeny with all the data then detect taxa that artificially inflate the tree diameter. We formulate an optimization problem that seeks to find k leaves that can be removed to reduce the tree diameter maximally. We present a polynomial time solution to this "k-shrink" problem. Given this solution, we then use non-parametric statistics to find an outlier set of taxa that have an unexpectedly high impact on the tree diameter. We test our method, TreeShrink, on five biological datasets, and show that it is more conservative than rogue taxon removal using RogueNaRok. When the amount of filtering is controlled, TreeShrink outperforms RogueNaRok in three out of the five datasets, and they tie in another dataset.
机译:系统发生树由于各种原因而包含错误。我们认为,一种检测错误的方法是使用所有数据构建系统发育树,然后检测人为增加树木直径的分类单元。我们制定了一个优化问题,试图找到k个可以去除以最大程度地减小树木直径的叶子。我们提出了这个“ k-收缩”问题的多项式时间解。给定该解决方案后,我们将使用非参数统计信息来找到对树径产生意外高影响的离群分类群。我们在五个生物学数据集上测试了TreeShrink方法,并表明它比使用RogueNaRok删除流氓分类群更为保守。在控制过滤量的情况下,TreeShrink在五个数据集中的三个数据集中优于RogueNaRok,并且在另一个数据集中并列。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号