首页> 外文会议>International symposium on knowledge and systems sciences >An Iterative Multi-Strategy Approach to Classification
【24h】

An Iterative Multi-Strategy Approach to Classification

机译:分类的迭代多策略方法

获取原文

摘要

In certain knowledge discovery tasks that involve classification, the number of classes may be unknown ahead of time and it may vary depending on the application context. For example, given a remotely sensed imagery dataset, the number of land cover types is not known ahead of time and it may vary with different analytic objectives (e. g., treating forest as one class or two classes by differentiating the deciduous from the coniferous). Furthermore, different classification methods have different strengths and weaknesses so that each method works well only with data of certain classes. It is desirable to combine multiple classifiers to best utilize their strengths with a given classification task. This paper presents an iterative methodology that combines clustering, classification, and domain knowledge to obtain enhanced classification results. It first uses clustering techniques and clustering evaluation metrics to determine the number of clusters in the data. The metrics include sum of squared errors, a skewness measure, and a separationcohesion index. Then it iteratively trains several classifiers and uses their predictions to obtain optimal classification results. At each iteration, the classes predicted by the most accurate classifier are kept if the accuracy exceeds the required threshold and training datasets for the remaining classes are obtained by incorporating domain knowledge. The use of the methodology is demonstrated using two satellite imagery datasets.
机译:在涉及分类的某些知识发现任务中,类的数量可能提前未知,并且可能因应用上下文而变化。例如,给定远程感测的图像数据集,陆地覆盖类型的数量在时间之前不知道,并且可以随着不同的分析目标而变化(例如,通过将落叶与针叶的落叶区分,将森林视为一类或两类。此外,不同的分类方法具有不同的优点和缺点,因此每种方法仅适用于某些类的数据。希望将多个分类器组合以用给定分类任务的最佳利用它们的优点。本文提出了一种迭代方法,该方法结合了聚类,分类和域知识,以获得增强的分类结果。它首先使用群集技术和聚类评估度量来确定数据中的群集数。度量包括平方误差,偏斜度量和分离指数的总和。然后它迭代地列举了几个分类器并使用他们的预测来获得最佳分类结果。在每个迭代时,如果精度超过所需的阈值和剩余类别的训练数据集,则通过结合域知识来保存由最准确的分类器预测的类。使用两个卫星图像数据集来证明方法的使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号