Sample size for the evaluation of presence-absence models

Jimenez-Valverde Alberto

首页> 外文期刊>Ecological indicators >Sample size for the evaluation of presence-absence models

【24h】

Sample size for the evaluation of presence-absence models

机译：评估存在的模型的样本量

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The effect of the training dataset sample size has been shown to have profound outcomes on the performance of species distribution models. However, the effects that the testing dataset sample size can have on the assessment of a models predictive capacity has received little attention. In this study, I used simulations to study how accurate two discrimination statics, the AUC (the area under the receiver operating characteristic - ROC - curve) and Se* (the probability of correctly classifying any case and calculated from the threshold that makes minimum the difference between sensitivity and specificity), are estimated based on sample size. ROC curves with known discrimination ability were simulated, samples were randomly taken, the two discrimination statistics were estimated, and the differences between the two estimators and their respective true values were computed to understand how bias and precision were affected by sample size. In general, as sample size increases, the difference between reported and true discrimination capacity decreased. There were no important differences between the estimated AUC and Se* statistics in terms of bias and precision. Under realistic scenarios where the ROC points are not necessarily part of the true underlying ROC curve, the two discrimination statistics are both unbiased and equally precise, and the higher the true discrimination capacity is, the more accurate they are estimated. Between 20 and 30 is a lowest sample size limit since below this interval accuracy estimates considerably decreases. All together, these results are very important since many interesting SDM applications involve rare and poorly known species for which sample sizes are unavoidably small.

机译：训练数据集样本大小的效果已被证明对物种分布模型的性能进行了深远的结果。然而，测试数据集样本大小可能对模型预测容量进行评估的影响已经很少受到关注。在本研究中，我使用模拟来研究两个歧视估计的准确性，AUC（接收器运行特征 - Roc - 曲线下的区域）和SE *（正确分类任何案例的可能性并从最低阈值计算敏感性和特异性之间的差异）基于样本大小估计。模拟具有已知辨别能力的ROC曲线，随机采集样品，估计了两个辨别统计数据，两种估算器与其各自的真实值之间的差异是为了了解偏差和精度如何受到样本大小的影响。通常，随着样本大小的增加，报告和真正的歧视容量之间的差异减少。估计的AUC和SE *统计数据在偏差和精度方面没有重要差异。在ROC点不一定是真正的税率曲线的一部分的现实情景下，两种歧视统计数据既不偏见，同样准确，真正的歧视能力越高，估计越准确。在20和30之间是最低的样本量限制，因为低于该间隔精度估计显着降低。总之，这些结果非常重要，因为许多有趣的SDM应用涉及罕见的和众所周知的样本尺寸不可避免地小的物种。

著录项

来源
《Ecological indicators》 |2020年第7期|106289.1-106289.7|共7页
作者
Jimenez-Valverde Alberto;
展开▼
作者单位

Univ Alcala De Henares Fac Ciencias Dept Ciencias Vida Campus Univ Crta A-2 Km 33-6 Madrid 28805 Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
AUC; Accuracy; ROC curve; Sample size; Sensitivity; Specificity;

机译：AUC;准确性;ROC曲线;样品大小;敏感性;特异性;

相似文献

外文文献
中文文献
专利

1. Analysis of distribution patterns of yellow European eels in the Loire catchment using logistic models based on presence-absence of different size-classes [J] . Lasne E., Laffaille P. Ecology of Freshwater Fish . 2008,第1期

机译：基于不同大小等级存在与否的逻辑模型分析卢瓦尔河集水区欧洲黄鳗的分布模式
2. Evaluation and selection of models for out-of-sample prediction when the sample size is small relative to the complexity of the data-generating process [J] . Hannes Leeb Bernoulli: official journal of the Bernoulli Society for Mathematical Statistics and Probability . 2008,第3期

机译：当样本量相对于数据生成过程的复杂性较小时，评估和选择模型以进行样本外预测
3. Evaluation of non-linear-mixed-effect modeling to reduce the sample sizes of pediatric trials in type 2 diabetes mellitus [J] . Clémence Rigaux, Bernard Sébastien Journal of pharmacokinetics and pharmacodynamics . 2020,第1期

机译：非线性混合效应建模的评价，以减少2型糖尿病儿科试验的样本尺寸
4. Comparative Evaluation of Asymmetric Price Transmission Linear Models Using rMDL, eMDL, nMDL, gMDL, AIC and BIC Across Varying Sample Sizes [C] . Irene Kafui Vorsah Amponsah, Henry De-Graft Acquah, Nathaniel Kwamena Howard 2019 International Conference on Computing, Computational Modelling and Applications . 2019

机译：使用不同样本量的rMDL，eMDL，nMDL，gMDL，AIC和BIC的非对称价格传递线性模型的比较评估
5. Adequate sample sizes for viable 2-level hierarchical linear modeling analysis: A study on sample size requirement in HLM in relation to different intraclass correlations. [D] . Shih, Tse-Hua. 2008

机译：可行的两级分层线性建模分析所需的足够样本量：HLM中与不同类内相关性相关的样本量需求研究。
6. A Post-Hoc Evaluation of a Sample Size Re-estimation in the SPS3 Study Running head: Evaluating the SPS3 Sample Size Re-Estimation [O] . Leslie A McClure, Jeff M Szychowski, Oscar Benavente, -1

机译：在SPS3研究运行头中对样本大小重新估计进行事后评估：评估SPS3样本大小重新估计
7. Field Evaluation of a Presence-Absence, Sequential Sampling Plan for Pink Bollworm Eggs [O] . Hutchinson Bill, Stroschein Debra, Beasley Bud, 1988

机译：粉色棉铃虫卵有无的连续抽样计划的现场评估

Sample size for the evaluation of presence-absence models

摘要

著录项

相似文献

相关主题

期刊订阅