首页> 外文会议>EPIA Conference on Artificial Intelligence >Soil Classification Based on Physical and Chemical Properties Using Random Forests
【24h】

Soil Classification Based on Physical and Chemical Properties Using Random Forests

机译:基于理化性质的随机森林土壤分类

获取原文

摘要

Soil classification is a method of encoding the most relevant information about a given soil, namely its composition and characteristics, in a single class, to be used in areas like agriculture and forestry. In this paper, we evaluate how confidently we can predict soil classes, following the World Reference Base classification system, based on the physical and chemical characteristics of its layers. The Random Forests classifier was used with data consisting of 6 760 soil profiles composed by 19 464 horizons, collected in Mexico. Four methods of modelling the data were tested (i.e., standard depths, n first layers, thickness, and area weighted thickness). We also fine-tuned the best parameters for the classifier and for a k-NN imputation algorithm, used for addressing problems of missing data. Under-represented classes showed significantly worse results, by being repeatedly predicted as one of the majority classes. The best method to model the data was found to be the n first layers approach, with missing values being imputed with k-NN (k = 1). The results present a Kappa value from 0.36 to 0.48 and were in line with the state of the art methods, which mostly use remote sensing data.
机译:土壤分类是一种编码有关给定土壤的最相关信息的方法,即将其组成和特征归为一类,用于农业和林业等领域。在本文中,我们根据世界参考基准分类系统,基于其各层的物理和化学特征,评估了我们对土壤分类的信心程度。随机森林分类器用于包含在墨西哥收集的由19 464个地平线组成的6 760个土壤剖面的数据。测试了四种对数据建模的方法(即标准深度,n个第一层,厚度和面积加权厚度)。我们还微调了分类器和k-NN插补算法的最佳参数,这些参数用于解决丢失数据的问题。代表性不足的类别由于被反复预测为多数类别之一而显示出明显较差的结果。发现数据建模的最佳方法是第n层方法,用k-NN(k = 1)估算缺失值。结果显示Kappa值介于0.36到0.48之间,符合大多数使用遥感数据的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号