【24h】

Parallelizing the Phylogeny Problem

机译:并行系统发育问题

获取原文

摘要

The problem of determining the evolutionary history of species in the form of phylogenetic trees is known as the phylogeny problem. We present a parallelization of the character compatibility method for solving the phylogeny problem. Abstractly, the algorithm searches through all subsets of characters, which may be traits like opposable thumbs or DNA sequence values, looking for a maximal consistent subset. The notion of consistency in this case is the existence of a particular kind of phylogenetic tree called a perfect phylogeny tree. The two challenges to achieving an efficient implementation are load balancing and efficient sharing of information to enable pruning. In both cases, there is a trade-off between communication overhead and the quality of the solution. For load balancing we use a distributed task queue, which has imperfect load information but avoids centralization bottlenecks. For sharing pruning information, we use a distributed trie, which also avoids centralization but maintains incomplete information. We evaluate several implementations of the trie, the best of which achieves speedups of 50 on a 64-processor CM-5.
机译:确定系统发育树形式的物种进化史的问题被称为系统发育问题。我们提出了一种用于解决系统发育问题的字符兼容方法的并行化方法。抽象地,该算法搜索字符的所有子集,以寻找最大的一致性子集,这些子集可能是诸如对立拇指或DNA序列值之类的特征。在这种情况下,一致性的概念是存在一种称为完善系统发育树的系统发育树。实现有效实施的两个挑战是负载平衡和信息的有效共享以实现修剪。在这两种情况下,都需要在通信开销和解决方案质量之间进行权衡。对于负载平衡,我们使用分布式任务队列,该队列具有不完善的负载信息,但避免了集中化瓶颈。为了共享修剪信息,我们使用了分布式的特里,这也避免了集中化但维护了不完整的信息。我们评估了trie的几种实现,其中最好的实现是在64处理器CM-5上实现了50的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号