...
首页> 外文期刊>Ecological indicators >Sample size for the evaluation of presence-absence models
【24h】

Sample size for the evaluation of presence-absence models

机译:评估存在的模型的样本量

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The effect of the training dataset sample size has been shown to have profound outcomes on the performance of species distribution models. However, the effects that the testing dataset sample size can have on the assessment of a models predictive capacity has received little attention. In this study, I used simulations to study how accurate two discrimination statics, the AUC (the area under the receiver operating characteristic - ROC - curve) and Se* (the probability of correctly classifying any case and calculated from the threshold that makes minimum the difference between sensitivity and specificity), are estimated based on sample size. ROC curves with known discrimination ability were simulated, samples were randomly taken, the two discrimination statistics were estimated, and the differences between the two estimators and their respective true values were computed to understand how bias and precision were affected by sample size. In general, as sample size increases, the difference between reported and true discrimination capacity decreased. There were no important differences between the estimated AUC and Se* statistics in terms of bias and precision. Under realistic scenarios where the ROC points are not necessarily part of the true underlying ROC curve, the two discrimination statistics are both unbiased and equally precise, and the higher the true discrimination capacity is, the more accurate they are estimated. Between 20 and 30 is a lowest sample size limit since below this interval accuracy estimates considerably decreases. All together, these results are very important since many interesting SDM applications involve rare and poorly known species for which sample sizes are unavoidably small.
机译:训练数据集样本大小的效果已被证明对物种分布模型的性能进行了深远的结果。然而,测试数据集样本大小可能对模型预测容量进行评估的影响已经很少受到关注。在本研究中,我使用模拟来研究两个歧视估计的准确性,AUC(接收器运行特征 - Roc - 曲线下的区域)和SE *(正确分类任何案例的可能性并从最低阈值计算敏感性和特异性之间的差异)基于样本大小估计。模拟具有已知辨别能力的ROC曲线,随机采集样品,估计了两个辨别统计数据,两种估算器与其各自的真实值之间的差异是为了了解偏差和精度如何受到样本大小的影响。通常,随着样本大小的增加,报告和真正的歧视容量之间的差异减少。估计的AUC和SE *统计数据在偏差和精度方面没有重要差异。在ROC点不一定是真正的税率曲线的一部分的现实情景下,两种歧视统计数据既不偏见,同样准确,真正的歧视能力越高,估计越准确。在20和30之间是最低的样本量限制,因为低于该间隔精度估计显着降低。总之,这些结果非常重要,因为许多有趣的SDM应用涉及罕见的和众所周知的样本尺寸不可避免地小的物种。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号