首页> 外文会议>Signal Processing and Communications Applications Conference >Comparing the classification performances of supervised classifiers with balanced and imbalanced SAR data sets
【24h】

Comparing the classification performances of supervised classifiers with balanced and imbalanced SAR data sets

机译:将监督分类器与平衡和不平衡SAR数据集的分类性能进行比较

获取原文

摘要

In this study, the classification accuracies of four different classification methods with two balanced and two imbalanced data sets for the classification of Sentinel-1B SAR (Synthetic Aperture Radar) data were comparatively evaluated and the impacts of training data sets into the accuracy were investigated. In some circumstances, it is possible to collect high number of ground truth samples for some classes however not possible for some other classes which are represented by less number of ground truth samples. In such cases, the imbalanced data set is the issue. Supervised classifiers, by its nature, employ many different input parameters in consideration of the decision surface separating the two classes. More than the classification model itself, purity, size and allocation of ground truth samples as well as the adaptation between the training data and adopted classifier are of key importance in accuracy of image classification. In our study, two parametric (Naïve Bayes and Linear Discriminant Analysis) and two non-parametric (Support Vector Machines and Random Forests) supervised classification methods were implemented. Our experimental results demonstrated that there were not any significant change in classification accuracies of parametric classifiers and support vector machines however an increase in classification accuracy of random forest with imbalanced dataset. Furthermore, highest classification accuracy of this study (89.94%) was obtained by Support Vector Machines classification.
机译:在这项研究中,比较地评估了四种不同分类方法的分类精度,该方法具有两个平衡数据集和两个不平衡数据集,用于对Sentinel-1B SAR(合成孔径雷达)数据进行分类,并研究了训练数据集对准确性的影响。在某些情况下,可以为某些类别收集大量的地面事实样本,但是对于由较少数量的地面真实样本表示的其他某些类别则无法收集。在这种情况下,问题就在于数据集不平衡。考虑到将两个类别分开的决策面,监督分类器本质上采用许多不同的输入参数。除了分类模型本身之外,地面真值样本的纯度,大小和分配以及训练数据与采用的分类器之间的匹配对于图像分类的准确性至关重要。在我们的研究中,实施了两种参数(朴素贝叶斯和线性判别分析)和两种非参数(支持向量机和随机森林)监督分类方法。我们的实验结果表明,参数分类器和支持向量机的分类准确度没有任何显着变化,但是数据集不平衡的随机森林的分类准确度有所提高。此外,通过支持向量机分类获得了本研究的最高分类精度(89.94 \%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号