The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics

Brown JM; Lemmon AR

首页> 外文期刊>Systematic Biology >The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics

【24h】

The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics

机译：数据分区的重要性和贝叶斯因子在贝叶斯系统发育学中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As larger, more complex data sets are being used to infer phylogenies, accuracy of these phylogenies increasingly requires models of evolution that accommodate heterogeneity in the processes of molecular evolution. We investigated the effect of improper data partitioning on phylogenetic accuracy, as well as the type I error rate and sensitivity of Bayes factors, a commonly used method for choosing among different partitioning strategies in Bayesian analyses. We also used Bayes factors to test empirical data for the need to divide data in a manner that has no expected biological meaning. Posterior probability estimates are misleading when an incorrect partitioning strategy is assumed. The error was greatest when the assumed model was underpartitioned. These results suggest that model partitioning is important for large data sets. Bayes factors performed well, giving a 5% type I error rate, which is remarkably consistent with standard frequentist hypothesis tests. The sensitivity of Bayes factors was found to be quite high when the across-class model heterogeneity reflected that of empirical data. These results suggest that Bayes factors represent a robust method of choosing among partitioning strategies. Lastly, results of tests for the inclusion of unexpected divisions in empirical data mirrored the simulation results, although the outcome of such tests is highly dependent on accounting for rate variation among classes. We conclude by discussing other approaches for partitioning data, as well as other applications of Bayes factors.

机译：随着使用更大，更复杂的数据集来推断系统发育，这些系统发育的准确性越来越需要能够适应分子进化过程中异质性的进化模型。我们调查了不正确的数据分区对系统发育准确性的影响，以及贝叶斯分析中一种在不同分区策略中进行选择的常用方法贝叶斯因子的I型错误率和敏感性。我们还使用贝叶斯因子对经验数据进行测试，以了解是否需要以没有预期生物学意义的方式划分数据。当采用错误的分区策略时，后验概率估计会产生误导。当假设的模型未细分时，错误最大。这些结果表明，模型分区对于大型数据集很重要。贝叶斯因子表现良好，给出了5％的I型错误率，这与标准的频繁假设假设检验非常一致。当跨类模型异质性反映经验数据时，发现贝叶斯因子的敏感性非常高。这些结果表明，贝叶斯因子代表了一种在分区策略中进行选择的强大方法。最后，在经验数据中包含意外划分的测试结果反映了模拟结果，尽管此类测试的结果高度依赖于考虑类别之间的速率变化。最后，我们讨论了用于分区数据的其他方法以及贝叶斯因子的其他应用。

著录项

来源
《Systematic Biology》 |2007年第4期|共13页
作者
Brown JM; Lemmon AR;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. The Importance of Data Partitioning and the Utility of Bayes Factors in Bayesian Phylogenetics [J] . Jeremy M. Brown and Alan R. Lemmon Systematic Biology . 2007,第4期

机译：贝叶斯系统发生学中数据划分的重要性和贝叶斯因子的效用
2. Phylogenetic Systematics and Biogeography of Hummingbirds: Bayesian and Maximum Likelihood Analyses of Partitioned Data and Selection of an Appropriate Partitioning Strategy [J] . Jimmy A. McGuire1 Christopher C. Witt2 Douglas L. Altshuler3 and J. V. Remsen4 Systematic Biology . 2007,第5期

机译：蜂鸟的系统发生学和生物地理学：分区数据的贝叶斯和最大似然分析以及适当分区策略的选择
3. Phylogenetic systematics and biogeography of hummingbirds: bayesian and maximum likelihood analyses of partitioned data and selection of an appropriate partitioning strategy [J] . McGuire JA, Witt CC, Altshuler DL, Systematic Biology . 2007,第5期

机译：蜂鸟的系统发生学和生物地理学：分区数据的贝叶斯和最大似然分析以及适当分区策略的选择
4. Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement [C] . Minyoung Kim, Yuting Wang, Pritish Sahu, International Conference on Computer Vision . 2019

机译：Bayes-Factor-VAE：用于因子分解的多层贝叶斯深度自动编码器模型
5. Large datasets and Trichoptera phylogenetics: DNA barcodes, partitioned phylogenetic models, and the evolution of Phryganeidae. [D] . Frandsen, Paul Bryan. 2015

机译：大型数据集和毛翅目系统发育：DNA条码，分区的系统发育模型以及，科的进化。
6. Model Averaging and Bayes Factor Calculation of Relaxed Molecular Clocks in Bayesian Phylogenetics [O] . Wai Lok Sibon Li, Alexei J. Drummond -1

机译：贝叶斯系统发生学中分子钟的模型平均和贝叶斯因子计算
7. Phylogenetic Systematics and Biogeography of Hummingbirds: Bayesian and Maximum Likelihood Analyses of Partitioned Data and Selection of an Appropriate Partitioning Strategy [O] . Jimmy A. McGuire, Christopher C. Witt, Douglas L. Altshuler, 2007

机译：蜂鸟的系统发育系统和生物地理：贝叶斯和分区数据的最大似然分析以及适当的分区策略的选择

The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics

摘要

著录项

相似文献

相关主题

期刊订阅