首页> 外文期刊>Systematic Biology >PhyloBayes MPI: Phylogenetic Reconstruction with Infinite Mixtures of Profiles in a ParallelEnvironment
【24h】

PhyloBayes MPI: Phylogenetic Reconstruction with Infinite Mixtures of Profiles in a ParallelEnvironment

机译:PhyloBayes MPI:在并行环境中配置文件无限混合的系统发生重建

获取原文
获取原文并翻译 | 示例
           

摘要

Modeling across site variation of the substitution process is increasingly recognized as important for obtaining more accurate phylogenetic reconstructions. Both finite and infinite mixture models have been proposed and have been shown to significantly improve on classical single-matrix models. Compared with their finite counterparts, infinite mixtures have a greater expressivity. However, they are computationally more challenging. This has resulted in practical compromises in the design of infinitemixture models. In particular, a fast but simplified version of a Dirichlet process model over equilibrium frequency profiles implemented in PhyloBayes has often been used in recent phylogenomics studies, while more refined model structures, more realistic and empirically more fit, have been practically out of reach. We introduce a message passing interface version of PhyloBayes, implementing the Dirichlet process mixture models as well as more classical empirical matrices and finite mixtures. The parallelization is made efficient thanks to the combination of two algorithmic strategies: a partial Gibbs sampling update of the tree topology and the use of a truncated stick-breaking representation for the Dirichlet process prior. The implementation showsclose to linear gains in computational speed for up to 64 cores, thus allowing faster phylogenetic reconstruction under complex mixture models. PhyloBayes MPI is freely available from our website www.phylobayes.org.
机译:越来越多地认识到替代过程中跨位点变异的建模对于获得更准确的系统发育重建至关重要。已经提出了有限和无限混合模型,并且已证明它们在经典单矩阵模型上有显着改进。与它们的有限对应物相比,无限混合具有更高的表现力。但是,它们在计算上更具挑战性。这导致了无限混合模型设计中的实际折衷。特别是,在最近的系统遗传学研究中,通常在PhyloBayes中使用在平衡频率分布图上实现的Dirichlet过程模型的快速但简化的版本,而实际上,更精确的模型结构,更现实的和从经验上更合适的模型已经无法实现。我们引入了PhyloBayes的消息传递接口版本,实现了Dirichlet过程混合模型以及更多经典的经验矩阵和有限混合。归功于两种算法策略的结合,并行处理变得高效了:树形拓扑的部分Gibbs采样更新以及先前的Dirichlet过程使用了截断的折断表示。该实现在多达64个核的计算速度上显示出接近线性的增长,因此可以在复杂的混合物模型下更快地进行系统发育重建。 PhyloBayes MPI可从我们的网站www.phylobayes.org免费获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号