首页> 外文期刊>Systematic Biology >The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics
【24h】

The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics

机译:数据分区的重要性和贝叶斯因子在贝叶斯系统发育学中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

As larger, more complex data sets are being used to infer phylogenies, accuracy of these phylogenies increasingly requires models of evolution that accommodate heterogeneity in the processes of molecular evolution. We investigated the effect of improper data partitioning on phylogenetic accuracy, as well as the type I error rate and sensitivity of Bayes factors, a commonly used method for choosing among different partitioning strategies in Bayesian analyses. We also used Bayes factors to test empirical data for the need to divide data in a manner that has no expected biological meaning. Posterior probability estimates are misleading when an incorrect partitioning strategy is assumed. The error was greatest when the assumed model was underpartitioned. These results suggest that model partitioning is important for large data sets. Bayes factors performed well, giving a 5% type I error rate, which is remarkably consistent with standard frequentist hypothesis tests. The sensitivity of Bayes factors was found to be quite high when the across-class model heterogeneity reflected that of empirical data. These results suggest that Bayes factors represent a robust method of choosing among partitioning strategies. Lastly, results of tests for the inclusion of unexpected divisions in empirical data mirrored the simulation results, although the outcome of such tests is highly dependent on accounting for rate variation among classes. We conclude by discussing other approaches for partitioning data, as well as other applications of Bayes factors.
机译:随着使用更大,更复杂的数据集来推断系统发育,这些系统发育的准确性越来越需要能够适应分子进化过程中异质性的进化模型。我们调查了不正确的数据分区对系统发育准确性的影响,以及贝叶斯分析中一种在不同分区策略中进行选择的常用方法贝叶斯因子的I型错误率和敏感性。我们还使用贝叶斯因子对经验数据进行测试,以了解是否需要以没有预期生物学意义的方式划分数据。当采用错误的分区策略时,后验概率估计会产生误导。当假设的模型未细分时,错误最大。这些结果表明,模型分区对于大型数据集很重要。贝叶斯因子表现良好,给出了5%的I型错误率,这与标准的频繁假设假设检验非常一致。当跨类模型异质性反映经验数据时,发现贝叶斯因子的敏感性非常高。这些结果表明,贝叶斯因子代表了一种在分区策略中进行选择的强大方法。最后,在经验数据中包含意外划分的测试结果反映了模拟结果,尽管此类测试的结果高度依赖于考虑类别之间的速率变化。最后,我们讨论了用于分区数据的其他方法以及贝叶斯因子的其他应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号