首页> 美国卫生研究院文献>Genome Biology and Evolution >Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses
【2h】

Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses

机译:从贝叶斯系统发生分析估计树木拓扑结构的有效样本量

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC because sequential samples from the chain can be non-independent due to autocorrelation. Typically, phylogeneticists use a rule of thumb that the ESS of all parameters should be greater than 200. However, we have no method to calculate an ESS of tree topology samples, despite the fact that the tree topology is often the parameter of primary interest and is almost always central to the estimation of other parameters. That is, we lack a method to determine whether we have adequately sampled one of the most important parameters in our analyses. In this study, we address this problem by developing methods to estimate the ESS for tree topologies. We combine these methods with two new diagnostic plots for assessing posterior samples of tree topologies, and compare their performance on simulated and empirical data sets. Combined, the methods we present provide new ways to assess the mixing and convergence of phylogenetic tree topologies in Bayesian MCMC analyses.
机译:贝叶斯系统发育分析使用马尔可夫链蒙特卡洛(MCMC)方法估计系统发育树拓扑和其他参数的后验分布。在从这些分布进行推断之前,重要的是评估它们的适当性。为此,有效样本量(ESS)估计MCMC输出代表给定参数的真正独立样本有多少。参数的ESS经常比从MCMC提取的样本数低得多,因为由于自相关,来自链的顺序样本可能是非独立的。通常,系统进化论者使用经验法则,即所有参数的ESS均应大于200。但是,尽管树形拓扑通常是最重要的参数,但我们没有方法来计算树形拓扑样本的ESS。几乎总是其他参数估计的中心。也就是说,我们缺乏一种方法来确定我们是否已对分析中最重要的参数之一进行了充分采样。在这项研究中,我们通过开发估计树形拓扑结构的ESS的方法来解决这个问题。我们将这些方法与两个新的诊断图结合起来,以评估树形拓扑的后验样本,并比较它们在模拟和经验数据集上的性能。结合起来,我们提出的方法提供了评估贝叶斯MCMC分析中系统树结构混合和收敛的新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号