...
首页> 外文期刊>Geoderma: An International Journal of Soil Science >Evaluation of conditioned Latin hypercube sampling for soil mapping based on a machine learning method
【24h】

Evaluation of conditioned Latin hypercube sampling for soil mapping based on a machine learning method

机译:基于机器学习方法的土壤映射调节拉丁超立体采样评价

获取原文
获取原文并翻译 | 示例
           

摘要

Sampling design plays an important role in soil survey and soil mapping. Conditioned Latin hypercube sampling (cLHS) has been proven as an efficient sampling strategy and used widely in digital soil mapping. cLHS samples are randomly selected in each stratum of environmental variables, thus the produced sample sets can vary significantly at different runs with the same sample size. Although variation of mapping accuracies caused by the randomness of cLHS has been realized and qualitatively mentioned in past studies. However, how the randomness of cLHS could quantitatively influence mapping accuracy has rarely been examined. In this study, we conducted experiments to examine how the sample randomness quantitatively influence soil mapping accuracy with different sample sizes, and analyzed the possible reasons from a pedogenesis perspective. The results showed that the largest range of mapping accuracies of 500 repeats was 39.5% at a sample density of 2.59 point/ km(2), while the smallest range was 7.3% at the maximum sample size with a sample density of 32.47 point/km(2). The sample density for satisfactory prediction accuracies in our study area was at least 10.06 Point/km(2). The results showed that both the allocation of sample points to each soil series and the typicality of sample points played important roles in mapping accuracies. But the deep reasons causing the unstable performance of cLHS at small sample sizes were the imbalanced class distribution of soil series and the overlap between soil series in the distribution of environmental covariates. Researchers need to be cautious about the output when applying cLHS with small sampling densities. Some effective approaches to address this issue include increasing the sample size, checking the sample allocations of a cLHS design with the assistance of legacy soil maps, or adding the legacy soil map as a variable during sampling design. When the sampling resources and legacy soil maps are limited for an area, fuzzy k-means clustering sampling could be a potential alternative. This study provides useful references for better understanding the uncertainty of cLHS when the sample density is small and selecting alternative sampling methods accordingly.
机译:抽样设计在土壤调查和土壤映射中起着重要作用。有条件的拉丁杂交抽样(CLHS)被证明是一种有效的采样策略,并广泛用于数字土壤映射。在每个环境变量中随机选择ClHS样品,因此所产生的样品集可以在不同的样本大小的不同运行时显着变化。尽管在过去的研究中已经实现和定性提到了由CLHS随机性引起的映射精度的变化。然而,ClHS的随机性如何量化可以估量地影响映射精度。在这项研究中,我们进行了实验,以检查样品随机性如何定量地影响土壤映射精度与不同的样本尺寸,并分析了从施用角度的可能原因。结果表明,500重复的最大映射精度范围为39.5%,样品密度为2.59点/ km(2),而最小范围为7.3%,最大样本尺寸为32.47点/ km (2)。我们研究区域中令人满意的预测精度的样本密度至少为10.06点/ km(2)。结果表明,样本点对每个土壤系列的分配和样本点的典型性在映射精度下起着重要作用。但是,导致小型样本大小不稳定性能的深刻原因是土壤系列的不平衡阶级分布和环境协变量分布的土壤系列之间的重叠。在应用小型采样密度时,研究人员需要对输出持谨慎态度。解决此问题的一些有效方法包括增加样本大小,在传统土壤图的帮助下检查CLHS设计的样本分配,或者在采样设计期间将传统土壤图作为可变的遗留土壤图。当采样资源和遗留土壤图限制为一个区域时,模糊K-Means聚类采样可能是潜在的替代方案。本研究提供了有用的参考,以便更好地理解当样品密度小并相应地选择替代采样方法时的ClHS的不确定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号