On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features

Christian Geiß; Patrick Aravena Pelizari; Henrik Schrade; Alexander Brenning; Hannes Taubenböck

首页> 外文期刊>IEEE Geoscience and Remote Sensing Letters >On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features

【24h】

On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features

机译：基于空间特征的监督分类中空间不相交训练样本和测试样本对模型泛化能力估计的影响

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this letter, we establish two sampling schemes to select training and test sets for supervised classification. We do this in order to investigate whether estimated generalization capabilities of learned models can be positively biased from the use of spatial features. Numerous spatial features impose homogeneity constraints on the image data, whereby a spatially connected set of image elements is attributed identical feature values. In addition to a frequent occurrence of intrinsic spatial autocorrelation, this leads to extrinsic spatial autocorrelation with respect to the image data. The first sampling scheme follows a spatially random partitioning into training and test sets. In contrast to that, the second strategy implements a spatially disjoint partitioning, which considers in particular topological constraints that arise from the deployment of spatial features. Experimental results are obtained from multi- and hyperspectral acquisitions over urban environments. They underline that a large share of the differences between estimated generalization capabilities obtained with the spatially disjoint and non-disjoint sampling strategies can be attributed to the use of spatial features, whereby differences increase with an increasing size of the spatial neighborhood considered for computing a spatial feature. This stresses the necessity of a proper spatial sampling scheme for model evaluation to avoid overoptimistic model assessments.

机译：在这封信中，我们建立了两个抽样方案来选择用于监督分类的训练和测试集。我们这样做是为了调查学习模型的估计泛化能力是否可以因使用空间特征而出现正偏。许多空间特征在图像数据上施加了同质性约束，从而一组空间相连的图像元素被赋予相同的特征值。除了经常发生内在空间自相关之外，这还导致相对于图像数据的外部空间自相关。第一种采样方案是将空间随机划分为训练集和测试集。与此相反，第二种策略实现了空间上不相交的分区，该分区特别考虑了由空间特征的部署引起的拓扑约束。从城市环境中的多光谱和高光谱采集获得实验结果。他们强调，通过空间不相交和非不相交采样策略获得的估计泛化能力之间的差异中，很大一部分可以归因于空间特征的使用，由此，差异随着计算空间的空间邻域大小的增加而增加。特征。这强调了用于模型评估的适当空间采样方案的必要性，以避免过分乐观的模型评估。

著录项

来源
《IEEE Geoscience and Remote Sensing Letters》 |2017年第11期|2008-2012|共5页
作者
Christian Geiß; Patrick Aravena Pelizari; Henrik Schrade; Alexander Brenning; Hannes Taubenböck;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Correlation; Spatial resolution; Radio frequency; Computational modeling; Hyperspectral imaging;

机译：训练;相关性;空间分辨率;射频;计算模型;高光谱成像;

相似文献

外文文献
中文文献
专利

1. A Novel Approach to the Selection of Spatially Invariant Features for the Classification of Hyperspectral Images With Improved Generalization Capability [J] . Bruzzone L., Persello C. Geoscience and Remote Sensing, IEEE Transactions on . 2009,第9期

机译：具有改进泛化能力的高光谱图像分类空间不变特征选择的新方法
2. Estimating spatial models with endogenous variables, a spatial lag and spatially dependent disturbances: Finite sample properties [J] . Bernard Fingleton, Julie Le Gallo Papers in regional science . 2008,第3期

机译：估计具有内生变量，空间滞后和空间相关干扰的空间模型：有限样本属性
3. Effects of spatial sampling density and spatial extent on linear land use regression modelling of NO_2 estimates in an automobile-oriented city [J] . Maddix Melanie, Adams Matthew D. Atmospheric environment . 2020,第Octa期

机译：空间采样密度和空间程度对汽车导向城市NO_2估计线性土地利用回归建模的影响
4. Beyond Spatial Pyramids: A New Feature Extraction Framework with Dense Spatial Sampling for Image Classification [C] . Shengye Yan, Xinxing Xu, Dong Xu, European conference on computer vision . 2012

机译：超越空间金字塔：具有密集空间采样的新特征提取框架用于图像分类
5. Find, Inform, and Test (FIT): A Spatial Modeling Framework to Estimate Contributions of Spatially Distributed Sources to Microbial Contaminants in the Environment [D] . Wiesner-Friedman, Corinne E. 2021

机译：查找，通知和测试（适合）：空间建模框架，以估算空间分布源对环境中微生物污染物的贡献
6. On Splitting Training and Validation Set: A Comparative Study of Cross-Validation Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning [O] . Yun Xu, Royston Goodacre -1

机译：关于拆分训练和验证集：交叉验证自举和系统抽样的比较研究用于估计监督学习的泛化性能
7. On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features [O] . Christian Geib, Patrick Aravena Pelizari, Henrik Schrade, 2017

机译：关于空间非脱节训练和测试样本对空间特征监督分类估计模型概括能力的影响
8. Terrain and Spatial Effects on a Hazard Prediction and Assessment Capability (HPAC) Software Dose-Rate Contour Plot Predictions as Compared to a Sample of Local Fallout Data from Test Detonations in the Continental United States, 1945-1962 [R] . Pace, K. D. 2006

机译：与1945 - 1962年美国大陆测试爆轰的局部辐射数据样本相比，危险预测和评估能力（HpaC）软件剂量率等值线图预测的地形和空间效应

On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features

摘要

著录项

相似文献

相关主题

期刊订阅