首页> 外文会议>IEEE International Conference on Fuzzy Systems >A first approach towards the usage of classifiers' performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

【24h】

A first approach towards the usage of classifiers' performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

机译：用于对分类器集合创建模糊措施的第一种方法：对高度不平衡数据集的案例研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work we study the possibility of learning fuzzy measures from classifiers' performance for improving the standard aggregation methods in classifier ensembles. Fuzzy measures are set-valued functions, which are not necessarily additive, and they are the basis for constructing non-linear fuzzy integrals, such as Choquet or Sugeno integral. These integrals have shown to be very useful in the aggregation of interacting criteria, since this interaction can be well modeled by a fuzzy measure. Classifier ensembles are composed of several classifiers and are aimed at improving the performance of every one of their counterparts. There are two main aspects about ensembles, first, how to build them, and second, how to combine the outputs of all their members. In this work, we focus on the second part, which is a key factor to obtain a successful ensemble. More specifically, we focus on the usage of fuzzy measures for the aggregation phase aiming at taking into account the coalitions and interactions among the members of the ensemble. Our hypothesis is that taking such information into account can lead to better performance. Moreover, we propose to directly obtain the fuzzy measure from data by considering the performance of each subset of classifiers in the ensemble. This way, one needs not include any additional learning for the fuzzy measure that can easily lead to overfitting. In order to test the usefulness of the proposed fuzzy measure, we will consider a set of 33 highly imbalanced datasets and we will develop a complete experimental study comparing the proposed combination scheme with other approaches commonly considered in the literature.

机译：在这项工作中，我们研究了从分类器的性能学习模糊措施的可能性，以改善分类器集群中的标准聚合方法。模糊措施是设定值的函数，这不一定是添加剂，它们是构造非线性模糊积分的基础，例如Chromet或Sugeno积分。这些积分在交互标准的聚合中显示出非常有用，因为这种相互作用可以通过模糊测量良好建模。分类器集合由多个分类器组成，旨在提高每个同行的性能。关于合奏有两个主要方面，首先，如何构建它们，而第二个，如何组合所有成员的输出。在这项工作中，我们专注于第二部分，这是获得成功集成的关键因素。更具体地说，我们专注于旨在考虑到集团成员之间联盟和互动的汇总阶段的模糊措施的使用。我们的假设是考虑到此类信息可能会导致更好的表现。此外，我们建议通过考虑集合体中的每个分类器的性能，直接从数据中获得模糊措施。这样，一个不需要包括任何可以容易地导致过度装备的模糊措施的额外学习。为了测试拟议的模糊措施的有用性，我们将考虑一组33个高度不平衡的数据集，我们将开发一个完整的实验研究，比较拟议的组合方案与文献中常见的其他方法。

著录项

来源
《IEEE International Conference on Fuzzy Systems》|2018年|596p|共8页
会议地点
作者
M. Uriz; D. Paternain; H. Bustince; M. Galar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273.4-53;
关键词
Aggregations; Fuzzy Measures; Classification; Ensembles; Imbalanced datasets;

机译：聚合;模糊措施;分类;合奏;不平衡数据集;

相似文献

外文文献
中文文献
专利

1. Ordering-based pruning for improving the performance of ensembles of classifiers in the framework of imbalanced datasets [J] . Galar Mikel, Fernandez Alberto, Barrenechea Edurne, Information Sciences: An International Journal . 2016,第Null期

机译：基于排序的修剪在不平衡数据集框架内提高分类器集合的性能
2. Two Stage Comparison of Classifier Performances for Highly Imbalanced Datasets [J] . Goran Ore?ki, Stjepan Ore?ki Journal of Information and Organizational Sciences . 2015,第2期

机译：高度不平衡数据集分类器性能的两阶段比较
3. Feature Selection and Ensemble Learning Techniques in One-Class Classifiers: An Empirical Study of Two-Class Imbalanced Datasets [J] . Chih-Fong Tsai, Wei-Chao Lin Quality Control, Transactions . 2021,第1期

机译：单级分类器中的特征选择和集合学习技术：两级不平衡数据集的实证研究
4. A first approach towards the usage of classifiers’ performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets [C] . M. Uriz, D. Paternain, H. Bustince, IEEE International Conference on Fuzzy Systems . 2018

机译：利用分类器性能为分类器集合创建模糊度量的第一种方法：以高度不平衡的数据集为例
5. Diversified ensemble classifiers for highly imbalanced data learning and its application in bioinformatics. [D] . Ding, Zejin. 2011

机译：用于高度不平衡数据学习的多元化集成分类器及其在生物信息学中的应用。
6. iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets [O] . Jianhua Jia, Zi Liu, Xuan Xiao, 2016

机译：iPPBS-Opt：一种基于序列的集成分类器用于通过优化不平衡训练数据集来识别蛋白质与蛋白质的结合位点
7. iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets [O] . Jianhua Jia, Zi Liu, Xuan Xiao, 2016

机译：ippBs-Opt：基于序列的集成分类器，用于通过优化不平衡训练数据集来识别蛋白质 - 蛋白质结合位点

A first approach towards the usage of classifiers' performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

摘要

著录项

相似文献

相关主题

期刊订阅