首页> 外文期刊>Aquatic Ecology >Exploration of sample size and diatom-based indicator performance in three North American phosphorus training sets
【24h】

Exploration of sample size and diatom-based indicator performance in three North American phosphorus training sets

机译:探索三个北美磷训练集的样本量和基于硅藻的指标表现

获取原文
获取原文并翻译 | 示例
           

摘要

Three large training sets were investigated to determine optimal sample sizes for diatom-based inference models. The sample sets represented (1) assemblages from Great Lakes coastlines, (2) phytoplankton from the pelagic Great Lakes and (3) surface sediment assemblages from Minnesota lakes. Diatom-based weighted average models to infer nutrient concentrations were developed for each training set. Training set sample sizes ranging from 10 to the maximum number of samples were created through random sample selection, and performance of each model was evaluated. For each model iteration, diatom-inferred (DI) nutrient data were related to stressor data (e.g., adjacent agricultural or urban development) to characterize the ability of each model to track human activities. The relationships between model performance parameters (DI-stressor correlations and model r (2), error and bias) and sample size were used to determine the minimum sample size needed to optimize models for each region. Depending on the training set, at least 40-70 samples were needed to capture the variation in diatom assemblages and environmental conditions to such a degree that non-analog situations should be rare and so should provide an unambiguous result if the model was applied to any sample assemblage from the region. It is recommended that one exercises caution when dealing with smaller training sets unless there is certainty that the selected samples reflect the regional variability in diatom assemblages and environmental conditions.
机译:研究了三个大型训练集,以确定基于硅藻的推理模型的最佳样本量。样品集代表(1)大湖沿岸的集合体,(2)浮游大湖的浮游植物和(3)明尼苏达州湖的表层沉积物群。为每个训练集开发了基于硅藻的加权平均模型来推断营养物浓度。通过随机抽样选择,创建了从10到最大样本数量的训练集样本大小,并评估了每个模型的性能。对于每个模型迭代,将硅藻推断(DI)的营养数据与压力源数据(例如相邻的农业或城市发展)相关联,以表征每个模型跟踪人类活动的能力。模型性能参数(DI应力因子相关性和模型r(2),误差和偏差)与样本量之间的关系用于确定优化每个区域模型所需的最小样本量。根据训练集的不同,至少需要40-70个样本才能捕获硅藻集合体和环境条件的变化,以至于非模拟情况很少见,因此,如果将模型应用于任何情况,则应提供明确的结果。该区域的样本集合。建议在处理较小的训练集时要格外小心,除非可以确定所选的样本反映出硅藻集合体和环境条件的区域差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号