A Boosting based Adaptive Oversampling Technique for Treatment of Class Imbalance

机译：一种基于促进基于适应性过采样技术，用于治疗类别不平衡

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The topic of class imbalance and its consequences have steered up the field of research for quite a long, as they bring pivotal impact over real-life scenarios such as medical disease diagnosis, fraud detection, etc. The typical solutions include data-level (undersampling or oversampling) or algorithmic-level (cost-sensitive learning) approaches. Synthetic Minority Oversampling Technique (SMOTE) has been acknowledged as one of the most effective data level solutions, but often suffers from the drawback of overfitting due to uniform oversampling rate. The ensemble learning techniques have recently emerged as effective; but can yield best results when integrated with data level solutions. In this work, a Boosting based oversampling technique is introduced with a customized oversampling rate, within an ensemble framework through cost-sensitive error formulation. The oversampling rate is tailored by using Local Covariance Matrix (LCM), while AdaBoost ensemble model with C4.5 weak learner is implemented as the ensemble framework. The work is compared with six benchmark techniques, for seven binary datasets. The experimental results prove the efficiency of the proposed work in treatment of imbalanced data.

机译：类别不平衡的主题及其后果已经转向了相当长的研究领域，因为它们对现实生活场景带来关键影响，如医疗疾病诊断，欺诈检测等。典型的解决方案包括数据级（欠采样或过采样）或算法级（成本敏感的学习）方法。合成少数群体过采样技术（SMOTE）被认为是最有效的数据级别解决方案之一，但由于均匀的过采样率，通常存在过度装备的缺点。集合学习技术最近有效地出现;但是在与数据级别解决方案集成时可以产生最佳结果。在这项工作中，通过成本敏感的误差制定，以定制的过采样率引入了基于升高的过采样技术。通过使用当地协方差矩阵（LCM）量身定制的过采样率，而Adaboost集合模型具有C4.5弱学习者的型号将实现为集合框架。该工作与六个基准技术进行比较，适用于七个二进制数据集。实验结果证明了所提出的工作效率治疗不平衡数据。

著录项

来源
《International Conference on Computer Communication and Informatics》|2019年|510 p. :|共7页
会议地点
作者
Debashree Devi; Saroj K. Biswas; Biswajit Purkayastha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词
Training; Covariance matrices; Cost function; Classification algorithms; Informatics; Boosting; Data models;

机译：培训;协方差矩阵;成本函数;分类算法;情报;推进;数据模型;

相似文献

外文文献
中文文献
专利

1. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [J] . Jinyan Li, Simon Fong, Yunsick Sung, BioData Mining . 2016,第1期

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标合成少数过采样技术算法
2. A Boosting-Aided Adaptive Cluster-Based Undersampling Approach for Treatment of Class Imbalance Problem [J] . Devi Debashree, Namasudra Suyel, Kadry Seifedine International Journal of Data Warehousing and Mining . 2020,第3期

机译：一种基于辅助的自适应聚类的基于群体的欠采样方法，用于治疗类别不平衡问题
3. Local distribution-based adaptive minority oversampling for imbalanced data classification [J] . Wang Xinyue, Xu Jian, Zeng Tieyong, Neurocomputing . 2021,第Jana21期

机译：基于地方分布的自适应少数群体过采样，用于不平衡数据分类
4. A Boosting based Adaptive Oversampling Technique for Treatment of Class Imbalance [C] . Debashree Devi, Saroj K. Biswas, Biswajit Purkayastha International Conference on Computer Communication and Informatics . 2019

机译：基于Boosting的自适应过采样技术处理类不平衡问题
5. Cost-sensitive boosting for classification of imbalanced data. [D] . Sun, Yanmin. 2007

机译：成本敏感型提升对不平衡数据的分类。
6. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [O] . Jinyan Li, Simon Fong, Yunsick Sung, 2016

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标综合少数抽样技术算法
7. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [O] . 2016

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标综合少数抽样技术算法
8. Oversampling PCM Techniques and Optimum Noise Shapers for Quantizing a Class ofNonbandlimited Signals [R] . Vaidyanathan, P. P., Tuqan, J. 1996

机译：过采样pCm技术和最佳噪声整形器，用于量化一类非带限信号

A Boosting based Adaptive Oversampling Technique for Treatment of Class Imbalance

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅