首页> 中文期刊> 《计算机科学与探索 》 >面向不平衡数据集的改进型SMOTE算法

面向不平衡数据集的改进型SMOTE算法

             

摘要

针对SMOTE(synthetic minority over-sampling technique)在合成少数类新样本时存在的不足,提出了一种改进的SMOTE算法GA-SMOTE。该算法的关键将是遗传算法中的3个基本算子引入到SMOTE中,利用选择算子实现对少数类样本有区别的选择,使用交叉、变异算子实现对合成样本质量的控制.结合GA-SMOTE与SVM(support vector machine)算法来处理不平衡数据的分类问题.UCI数据集上的大量实验表明,GA-SMOTE在新样本的整体合成效果上表现出色,有效提高了SVM在不平衡数据集上的分类性能。%Based on analyzing the shortages of SMOTE (synthetic minority over-sampling technique) in the synthesis of minority class samples, this paper presents an improved SMOTE (GA-SMOTE). The key of GA-SMOTE lies on leading three basic genetic operators of genetic algorithm (GA) into SMOTE, making use of the selection operator to achieve the different samples from the minority class and depending on crossover operator and mutation operator to realize the fine control of the synthesis quality to the minority class samples. GA-SMOTE and SVM (support vector machine) are combined to handle the classification problem on imbalanced datasets. A large amount of experiments on the UCI datasets show that GA-SMOTE promises prominent synthesis effect to the minority class samples, and brings better classification performance on imbalanced datasets with SVM.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号