...
首页> 外文期刊>IEEE Geoscience and Remote Sensing Letters >On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features
【24h】

On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features

机译:基于空间特征的监督分类中空间不相交训练样本和测试样本对模型泛化能力估计的影响

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this letter, we establish two sampling schemes to select training and test sets for supervised classification. We do this in order to investigate whether estimated generalization capabilities of learned models can be positively biased from the use of spatial features. Numerous spatial features impose homogeneity constraints on the image data, whereby a spatially connected set of image elements is attributed identical feature values. In addition to a frequent occurrence of intrinsic spatial autocorrelation, this leads to extrinsic spatial autocorrelation with respect to the image data. The first sampling scheme follows a spatially random partitioning into training and test sets. In contrast to that, the second strategy implements a spatially disjoint partitioning, which considers in particular topological constraints that arise from the deployment of spatial features. Experimental results are obtained from multi- and hyperspectral acquisitions over urban environments. They underline that a large share of the differences between estimated generalization capabilities obtained with the spatially disjoint and non-disjoint sampling strategies can be attributed to the use of spatial features, whereby differences increase with an increasing size of the spatial neighborhood considered for computing a spatial feature. This stresses the necessity of a proper spatial sampling scheme for model evaluation to avoid overoptimistic model assessments.
机译:在这封信中,我们建立了两个抽样方案来选择用于监督分类的训练和测试集。我们这样做是为了调查学习模型的估计泛化能力是否可以因使用空间特征而出现正偏。许多空间特征在图像数据上施加了同质性约束,从而一组空间相连的图像元素被赋予相同的特征值。除了经常发生内在空间自相关之外,这还导致相对于图像数据的外部空间自相关。第一种采样方案是将空间随机划分为训练集和测试集。与此相反,第二种策略实现了空间上不相交的分区,该分区特别考虑了由空间特征的部署引起的拓扑约束。从城市环境中的多光谱和高光谱采集获得实验结果。他们强调,通过空间不相交和非不相交采样策略获得的估计泛化能力之间的差异中,很大一部分可以归因于空间特征的使用,由此,差异随着计算空间的空间邻域大小的增加而增加。特征。这强调了用于模型评估的适当空间采样方案的必要性,以避免过分乐观的模型评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号