首页> 美国卫生研究院文献>other >A critical issue in model-based inference for studying trait-based community assembly and a solution
【2h】

A critical issue in model-based inference for studying trait-based community assembly and a solution

机译:基于模型的推理中研究基于特征的社区组装的关键问题和解决方案

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Statistical testing of trait-environment association from data is a challenge as there is no common unit of observation: the trait is observed on species, the environment on sites and the mediating abundance on species-site combinations. A number of correlation-based methods, such as the community weighted trait means method (CWM), the fourth-corner correlation method and the multivariate method RLQ, have been proposed to estimate such trait-environment associations. In these methods, valid statistical testing proceeds by performing two separate resampling tests, one site-based and the other species-based and by assessing significance by the largest of the two p-values (the pmax test). Recently, regression-based methods using generalized linear models (GLM) have been proposed as a promising alternative with statistical inference via site-based resampling. We investigated the performance of this new approach along with approaches that mimicked the pmax test using GLM instead of fourth-corner. By simulation using models with additional random variation in the species response to the environment, the site-based resampling tests using GLM are shown to have severely inflated type I error, of up to 90%, when the nominal level is set as 5%. In addition, predictive modelling of such data using site-based cross-validation very often identified trait-environment interactions that had no predictive value. The problem that we identify is not an “omitted variable bias” problem as it occurs even when the additional random variation is independent of the observed trait and environment data. Instead, it is a problem of ignoring a random effect. In the same simulations, the GLM-based pmax test controlled the type I error in all models proposed so far in this context, but still gave slightly inflated error in more complex models that included both missing (but important) traits and missing (but important) environmental variables. For screening the importance of single trait-environment combinations, the fourth-corner test is shown to give almost the same results as the GLM-based tests in far less computing time.
机译:由于没有共同的观察单位,因此对数据进行性状-环境关联性的统计测试是一个挑战:在物种,站点上的环境以及物种-站点组合中的介导丰度上都可以观察到性状。已经提出了许多基于相关的方法,例如社区加权特征均值方法(CWM),第四角相关方法和多元方法RLQ,以估计此类特征与环境的关联。在这些方法中,有效的统计检验是通过执行两个单独的重采样检验(一个基于地点的评估和另一种基于物种的检验)并通过评估两个p值中的最大值(pmax检验)来进行的。最近,已提出使用广义线性模型(GLM)的基于回归的方法,作为通过基于站点的重采样进行统计推断的有前途的替代方法。我们研究了这种新方法的性能以及模仿使用GLM而不是第四角的pmax测试的方法的性能。通过使用物种对环境响应具有其他随机变化的模型进行仿真,使用名义线性模型进行的基于现场的重采样测试表明,当标称水平设置为5%时,I型误差会严重膨胀,最高可达90%。此外,使用基于站点的交叉验证对此类数据进行预测建模通常会确定没有预测价值的性状-环境相互作用。我们确定的问题不是“遗漏变量偏差”问题,因为即使附加的随机变化与观察到的性状和环境数据无关,它也会发生。相反,这是忽略随机效应的问题。在相同的模拟中,基于GLM的pmax测试控制了迄今为止在此情况下提出的所有模型中的I型错误,但在更复杂的模型中仍会出现虚假的错误,这些模型既包含缺失(但很重要)特征又包含缺失(但很重要) )环境变量。为了筛选单个性状-环境组合的重要性,显示了第四角测试在短于计算时间的情况下提供了与基于GLM的测试几乎相同的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号