首页> 外文会议>IEEE International Conference on Fuzzy Systems >A first approach towards the usage of classifiers' performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets
【24h】

A first approach towards the usage of classifiers' performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

机译:用于对分类器集合创建模糊措施的第一种方法:对高度不平衡数据集的案例研究

获取原文

摘要

In this work we study the possibility of learning fuzzy measures from classifiers' performance for improving the standard aggregation methods in classifier ensembles. Fuzzy measures are set-valued functions, which are not necessarily additive, and they are the basis for constructing non-linear fuzzy integrals, such as Choquet or Sugeno integral. These integrals have shown to be very useful in the aggregation of interacting criteria, since this interaction can be well modeled by a fuzzy measure. Classifier ensembles are composed of several classifiers and are aimed at improving the performance of every one of their counterparts. There are two main aspects about ensembles, first, how to build them, and second, how to combine the outputs of all their members. In this work, we focus on the second part, which is a key factor to obtain a successful ensemble. More specifically, we focus on the usage of fuzzy measures for the aggregation phase aiming at taking into account the coalitions and interactions among the members of the ensemble. Our hypothesis is that taking such information into account can lead to better performance. Moreover, we propose to directly obtain the fuzzy measure from data by considering the performance of each subset of classifiers in the ensemble. This way, one needs not include any additional learning for the fuzzy measure that can easily lead to overfitting. In order to test the usefulness of the proposed fuzzy measure, we will consider a set of 33 highly imbalanced datasets and we will develop a complete experimental study comparing the proposed combination scheme with other approaches commonly considered in the literature.
机译:在这项工作中,我们研究了从分类器的性能学习模糊措施的可能性,以改善分类器集群中的标准聚合方法。模糊措施是设定值的函数,这不一定是添加剂,它们是构造非线性模糊积分的基础,例如Chromet或Sugeno积分。这些积分在交互标准的聚合中显示出非常有用,因为这种相互作用可以通过模糊测量良好建模。分类器集合由多个分类器组成,旨在提高每个同行的性能。关于合奏有两个主要方面,首先,如何构建它们,而第二个,如何组合所有成员的输出。在这项工作中,我们专注于第二部分,这是获得成功集成的关键因素。更具体地说,我们专注于旨在考虑到集团成员之间联盟和互动的汇总阶段的模糊措施的使用。我们的假设是考虑到此类信息可能会导致更好的表现。此外,我们建议通过考虑集合体中的每个分类器的性能,直接从数据中获得模糊措施。这样,一个不需要包括任何可以容易地导致过度装备的模糊措施的额外学习。为了测试拟议的模糊措施的有用性,我们将考虑一组33个高度不平衡的数据集,我们将开发一个完整的实验研究,比较拟议的组合方案与文献中常见的其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号