首页> 外文会议>IEEE Symposium on Computers and Communication >Towards a Faster and Accurate Supertree Inference
【24h】

Towards a Faster and Accurate Supertree Inference

机译:迈向更快,准确的超级推理

获取原文

摘要

Phylogenetic inference is one of the most challenging and important problems in computational biology. However, computing evolutionary links on data sets containing only few thousands of taxa easily becomes a daunting task. Moreover, recent advances in next-generation sequencing technologies are turning this problem even much harder, either in terms of complexity or scale. Therefore, phylogenetic inference requires new algorithms and methods to handle the unprecedented growth of biological data. In this paper, we identify several types of parallelism that are available while refining a supertree. We also present four improvements that we made to SuperFine - a state-of-the-art supertree (meta)method-, which add support: i) to use FastTree as the inference tool; ii) to use a parallel version of FastTree, or RAxML, as the inference tool; iii) to exploit intra-polytomy parallelism within the so-called polytomy refinement phase; and iv) to exploit, at the same time, inter-polytomy and intra-polytomy parallelism within the polytomy refinement phase. Together, these improvements allow an efficient and transparent exploitation of hybrid-polytomy parallelism. Additionally, we pinpoint how future contributions should enhance the performance of such applications. Our studies show groundbreaking results in terms of the achieved speedups, specially when using biological data sets. Moreover, we show that the new parallel strategy - which exploits the hybrid-polytomy parallelism within the polytomy refinement phase - exhibits good scalability, even in the presence of asymmetric sets of tasks. Furthermore, the achieved results show that the radical improvement in performance does not impair tree accuracy, which is a key issue in phylogenetic inferences.
机译:系统发育推论是计算生物学中最具挑战性和最重要的问题之一。然而,计算含有几千个分类达的数据集的进化进化链接容易成为令人生畏的任务。此外,下一代测序技术的最近进步甚至在复杂性或规模方面都更加困难。因此,系统发育推理需要新的算法和方法来处理生物数据的前所未有的生长。在本文中,我们确定了炼素时可用的几种并行性。我们还提出了四种改进,我们向超细作出了 - 一种最先进的超级(META)方法 - 添加支持:i)使用FastTree作为推理工具; ii)使用FastTree或RaxML的并行版本,作为推理工具; iii)在所谓的多元细化阶段内利用多种多心的并行性;和IV)在多种细化阶段同时,在多种性细胞间和多元细胞内并行性剥削。这些改进在一起允许对杂交 - 多元素平行度的有效和透明的开发。此外,我们还确定了未来的贡献应该如何增强这些应用程序的表现。我们的研究表明,在使用生物数据集时,在实现的加速方面表现出突破性的结果。此外,我们表明,即使在存在不对称的任务组的存在下,新的并联策略 - 利用多种细胞改进阶段内的混合多元素并行性 - 表现出良好的可扩展性。此外,达到的结果表明,性能的根本性改善不会损害树精度,这是系统发育推论的关键问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号