A novel ensemble method for classifying imbalanced data

Sun Zhongbin; Song Qinbao; Zhu Xiaoyan; Sun Heli; Xu Baowen; Zhou Yuming

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A novel ensemble method for classifying imbalanced data

【24h】

A novel ensemble method for classifying imbalanced data

机译：一种新的不平衡数据分类方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The class imbalance problems have been reported to severely hinder classification performance of many standard learning algorithms, and have attracted a great deal of attention from researchers of different fields. Therefore, a number of methods, such as sampling methods, cost-sensitive learning methods, and bagging and boosting based ensemble methods, have been proposed to solve these problems. However, these conventional class imbalance handling methods might suffer from the loss of potentially useful information, unexpected mistakes or increasing the likelihood of overfitting because they may alter the original data distribution. Thus we propose a novel ensemble method, which firstly converts an imbalanced data set into multiple balanced ones and then builds a number of classifiers on these multiple data with a specific classification algorithm. Finally, the classification results of these classifiers for new data are combined by a specific ensemble rule. In the empirical study, different class imbalance data handling methods including three conventional sampling methods, one cost-sensitive learning method, six Bagging and Boosting based ensemble methods, our previous method EM1vs1 and two fuzzy-rule based classification methods were compared with our method. The experimental results on 46 imbalanced data sets show that our proposed method is usually superior to the conventional imbalance data handling methods when solving the highly imbalanced problems. (C) 2014 Elsevier Ltd. All rights reserved.

机译：据报道，类不平衡问题严重阻碍了许多标准学习算法的分类性能，并引起了不同领域研究人员的极大关注。因此，已经提出了许多方法来解决这些问题，例如抽样方法，成本敏感的学习方法以及基于装袋和增强的集成方法。但是，这些常规的类别不平衡处理方法可能会遭受潜在有用信息的丢失，意外错误或过拟合的可能性增加，因为它们可能会更改原始数据分布。因此，我们提出了一种新颖的集成方法，该方法首先将不平衡数据集转换为多个平衡数据集，然后使用特定的分类算法在这些多个数据上建立多个分类器。最后，这些分类器对新数据的分类结果通过特定的集成规则进行组合。在实证研究中，将不同类别的不平衡数据处理方法（包括三种常规采样方法，一种成本敏感型学习方法，六种基于Bagging和Boosting的集成方法，我们先前的方法EM1vs1和两种基于模糊规则的分类方法）与我们的方法进行了比较。在46个不平衡数据集上的实验结果表明，在解决高度不平衡问题时，我们提出的方法通常优于传统的不平衡数据处理方法。（C）2014 Elsevier Ltd.保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2015年第5期|共15页
作者
Sun Zhongbin; Song Qinbao; Zhu Xiaoyan; Sun Heli; Xu Baowen; Zhou Yuming;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Imbalanced data; Classification; Ensemble learning;

机译：数据不平衡分类分类学习;

相似文献

外文文献
中文文献
专利

1. A weighted hybrid ensemble method for classifying imbalanced data [J] . Zhao Jiakun, Jin Ju, Chen Si, Knowledge-Based Systems . 2020,第Sepa5期

机译：用于分类不平衡数据的加权混合集合方法
2. A new sampling method for classifying imbalanced data based on support vector machine ensemble [J] . Jian Chuanxia, Gao Jian, Ao Yinhui Neurocomputing . 2016,第Juna12期

机译：基于支持向量机集成的不平衡数据分类新采样方法
3. A novel ensemble method for classifying imbalanced data [J] . Sun Zhongbin, Song Qinbao, Zhu Xiaoyan, Pattern Recognition: The Journal of the Pattern Recognition Society . 2015,第5期

机译：一种新的不平衡数据分类方法
4. A first approach towards the usage of classifiers’ performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets [C] . M. Uriz, D. Paternain, H. Bustince, IEEE International Conference on Fuzzy Systems . 2018

机译：利用分类器性能为分类器集合创建模糊度量的第一种方法：以高度不平衡的数据集为例
5. Diversified ensemble classifiers for highly imbalanced data learning and its application in bioinformatics. [D] . Ding, Zejin. 2011

机译：用于高度不平衡数据学习的多元化集成分类器及其在生物信息学中的应用。
6. iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets [O] . Jianhua Jia, Zi Liu, Xuan Xiao, 2016

机译：iPPBS-Opt：一种基于序列的集成分类器用于通过优化不平衡训练数据集来识别蛋白质与蛋白质的结合位点
7. Concept Drift Detection and Adaption in Big Imbalance Industrial IoT Data Using an Ensemble Learning Method of Offline Classifiers [O] . Chun-Cheng Lin, Der-Jiunn Deng, Chin-Hung Kuo, 2019

机译：使用离线分类器的集合学习方法概念漂移检测和对大不平衡工业物联网数据的适应

A novel ensemble method for classifying imbalanced data

摘要

著录项

相似文献

相关主题

期刊订阅