首页> 外文期刊>Evolutionary biology >A Cautionary Note on Phylogenetic Signal Estimation from Imputed Databases
【24h】

A Cautionary Note on Phylogenetic Signal Estimation from Imputed Databases

机译:关于插补数据库系统发育信号估计的警示性说明

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Given the prevalence of missing data on species' traits - the Raunkiaeran shortfall-, several methods have been proposed to fill sparse databases. However, analyses based on these imputed databases can introduce several biases. Here, we evaluated potential estimation biases caused by the use of imputed databases. In the evaluation, we considered the estimation of descriptive statistics, regression coefficient, and phylogenetic signal for different missing and imputing scenarios. We found that percentage of missing data, missing mechanisms and imputation methods were important in determining estimation errors. Imputation errors are not linearly related to estimate errors. Adding phylogenetic information provides better estimates of the evaluated statistics, but this information should be combined with other variables such as traits correlated to the missing data variable. Using an empirical dataset, we found that even traits that are strongly correlated to each other, such as brain and body size of primates, can produce biases when estimating phylogenetic signal from missing data datasets. We advise researchers to share both their raw and imputed data as well as to consider the pattern of missing data to evaluate methods that perform better for their goals. In addition, the performance of imputation methods should be mainly based on statistical estimates instead of only in imputation error.
机译:鉴于物种性状数据缺失的普遍存在 - Raunkiaeran短缺 - 已经提出了几种方法来填补稀疏的数据库。然而,基于这些插补数据库的分析可能会引入一些偏差。在这里,我们评估了使用插补数据库引起的潜在估计偏差。在评估中,我们考虑了描述性统计、回归系数和系统发育信号对不同缺失和插补情景的估计。我们发现缺失数据、缺失机制和插补方法的百分比在确定估计误差方面很重要。插补误差与估计误差不呈线性关系。添加系统发育信息可以更好地估计评估的统计数据,但此信息应与其他变量(例如与缺失数据变量相关的性状)相结合。使用经验数据集,我们发现,即使是彼此密切相关的特征,例如灵长类动物的大脑和身体大小,在从缺失的数据数据集中估计系统发育信号时也会产生偏差。我们建议研究人员分享他们的原始数据和估算数据,并考虑缺失数据的模式,以评估更能实现其目标的方法。此外,插补方法的性能应主要基于统计估计,而不仅仅是插补误差。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号