首页> 外文期刊>SIAM Journal on Discrete Mathematics >FAST CONVERGENCE OF MARKOV CHAIN MONTE CARLO ALGORITHMS FOR PHYLOGENETIC RECONSTRUCTION WITH HOMOGENEOUS DATA ON CLOSELY RELATED SPECIES
【24h】

FAST CONVERGENCE OF MARKOV CHAIN MONTE CARLO ALGORITHMS FOR PHYLOGENETIC RECONSTRUCTION WITH HOMOGENEOUS DATA ON CLOSELY RELATED SPECIES

机译:亲缘关系密切的物种均质数据系统发育的马尔可夫链蒙特卡罗算法的快速收敛

获取原文
获取原文并翻译 | 示例
           

摘要

This paper studies a Markov chain for phylogenetic reconstruction which uses a popular' transition between tree topologies known as subtree pruning-and-regrafting (SPR). We analyze the Markov chain in the simpler setting where the generating tree consists of very short edge lengths, short enough so that each sample from the generating tree (or character in phylogenetic terminology) is likely to have only one mutation, and where there are enough samples so that the data looks like the generating distribution. We prove in this setting that the Markov chain is rapidly mixing, i.e., it quickly converges to its stationary distribution, which is the posterior distribution over tree topologies. Our proofs use that the leading term of the maximum likelihood function of a tree T is the maximum parsimony score, which is the size of the minimum cut in T needed to realize single edge cuts of the generating tree. Our main contribution is a combinatorial proof that, in our simplified setting, SPR moves are guaranteed to converge quickly to the maximum parsimony tree. Our results are in contrast to recent works showing examples with heterogeneous data (namely, the data is generated from a mixture distribution) where many natural Markov chains are exponentially slow to converge to the stationary distribution.
机译:本文研究了用于系统发育重建的马尔可夫链,该链使用了称为树修剪和移植(SPR)的树拓扑之间的流行过渡。我们在更简单的设置中分析马尔可夫链,其中生成树由非常短的边长组成,足够短,以使得生成树中的每个样本(或系统发育术语中的特征)很可能只有一个突变,并且有足够的突变样本,以便数据看起来像生成分布。我们在这种情况下证明了马尔可夫链正在快速混合,即它迅速收敛到其静态分布,即树形拓扑的后验分布。我们的证明使用树T的最大似然函数的前导项是最大简约分数,这是实现生成树的单边切割所需的T最小切割的大小。我们的主要贡献是组合证明,即在简化的设置下,SPR移动可确保快速收敛到最大简约树。我们的结果与最近的研究相反,后者的研究显示了具有异类数据的示例(即,数据是从混合分布生成的),在该数据中,许多自然马尔可夫链呈指数级缓慢收敛到平稳分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号