On the Scalability of Computing Triplet and Quartet Distances

机译：关于计算三联和四重奏距离的可扩展性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present an experimental evaluation of the algorithms by Brodal et al. [SODA 2013] for computing the triplet and quartet distance measures between two leaf labelled rooted and unrooted trees of arbitrary degree, respectively. The algorithms count the number of rooted tree topologies over sets of three leaves (triplets) and unrooted tree topologies over four leaves (quartets), respectively, that have different topologies in the two trees. The algorithms by Brodal et al. maintain a long sequence of variables (hundreds for quartets) for counting different cases to be considered by the algorithm, making it unclear if the algorithms would be of theoretical interest only. In our experimental evaluation of the algorithms the typical overhead per node is about 2 KB and 10 KB per node in the input trees for triplet and quartet computations, respectively. This allows us to compute the distance measures for trees with up to millions of nodes. The limiting factor is the amount of memory available. With 31 GB of memory all our input instances can be solved within a few minutes. In the algorithm by Brodal et al. a few choices were made, where alternative solutions possibly could improve the algorithm, in particular for quartet distance computations. For quartet computations we expand the algorithm to also consider alternative computations, and make two observations: First we observe that the running time can be improved from O(max(d_1, d_2)·n·lg n) to O(min(d_1, d_2)·n·lg n), where n is the number of leaves in the two trees, and d_1 and d_2 are the maximum degrees of the nodes in the two trees, respectively. Secondly, by taking a different approach to counting the number of disagreeing quartets we can reduce the number of calculations needed to calculate the quartet distance, improving both the running time and the space requirement by our algorithm by a constant factor.

机译：在本文中，我们通过Brodal等人提出了对算法的实验评价。 [SODA 2013]用于计算三重叶片之间的三重型和四重奏距离，分别标记为植根的叶片和任意程度的巨大树木。该算法分别计算三叶（三叶）和四叶（四重奏）上的三个叶子（三叶）和大型树拓扑结构的根拓粒子的数量，这两棵树中具有不同的拓扑。 Brodal等人的算法。保持长期变量（数百个四重奏），用于计算算法考虑不同的情况，如果算法仅是理论兴趣的情况下不清楚。在我们对算法的实验评估中，每个节点的典型开销分别为三联和四重奏计算的输入树中的每个节点大约2kb和10kb。这允许我们计算最多数百万节点的树木的距离测量。限制因素是可用的内存量。拥有31 GB的内存，所有的输入实例都可以在几分钟内解决。在Brodal等人的算法中。进行了几种选择，其中替代解决方案可能可以改善算法，特别是对于四重距离计算。对于四重奏计算，我们扩展算法还考虑替代计算，并进行两个观察结果：首先，我们观察到运行时间可以从O（最大（d_1，d_2）·n·lg n）到o（min（d_1， D_2）·n·lg n），其中n是两棵树中叶片的数量，d_1和d_2分别是两棵树中节点的最大程度。其次，通过采取不同的方法来计算不同意四重奏的数量，我们可以减少计算四重奏距离所需的计算次数，通过恒定因子改善我们的算法的运行时间和空间要求。

著录项

来源
《Workshop on Algorithm Engineering and Experiments》|2014年|165 p.|共11页
会议地点
作者
Morten Kragelund Holt; Jens Johansen; Gerth Stolting Brodal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. tqDist: a library for computing the quartet and triplet distances between binary or general trees [J] . Sand A, Holt MK, Johansen J, Bioinformatics . 2014,第14期

机译：tqDist：用于计算二叉树或普通树之间的四重奏和三重奏距离的库
2. Algorithms for Computing the Triplet and Quartet Distances for Binary General Trees [J] . Andreas Sand, Christian N. S. Pedersen, Gerth St#xF8, Biology . 2013,第4期

机译：二叉树的三重态和四重态距离的计算算法
3. A sub-cubic time algorithm for computing the quartet distance between two general trees [J] . Jesper Nielsen, Anders K Kristensen, Thomas Mailund, Algorithms for Molecular Biology . 2011,第1期

机译：一种亚三次时间算法，用于计算两棵普通树之间的四重奏距离
4. On the Scalability of Computing Triplet and Quartet Distances [C] . Morten Kragelund Holt, Jens Johansen, Gerth Stolting Brodal Workshop on Algorithm Engineering and Experiments . 2014

机译：关于计算三联和四重奏距离的可扩展性
5. Measuring acceptance of immigrant groups in the U.S.: The importance of the semantic differential scale in conjunction with the Social Distance Scale. [D] . Koleser, Jennifer M. 2009

机译：在美国衡量移民群体的接受程度：语义差异量表与“社会距离量表”结合使用的重要性。
6. Algorithms for Computing the Triplet Quartet Distances for Binary General Trees [O] . Andreas Sand, Morten K. Holt, Jens Johansen, 2013

机译：二叉树的三重态四重态距离计算算法
7. On the Scalability of Computing Triplet and Quartet Distances∗ [O] . Morten Kragelund, Holt Jens, Johansen Gerth, 2015

机译：关于计算三重和四重距离的可扩展性*
8. Properties of Quartet and Quintet Invariants and Triplet Seminvariants and Their Application in Direct Methods for X-Ray Structure Determination [R] . van der Putten, N. 1980

机译：四重和五重不变量和三重变体的性质及其在直接X射线结构测定方法中的应用

On the Scalability of Computing Triplet and Quartet Distances

摘要

著录项

相似文献

相关主题

期刊订阅