首页> 外文会议>International Joint Conference on Computer Science and Software Engineering >DBSM: The combination of DBSCAN and SMOTE for imbalanced data classification
【24h】

DBSM: The combination of DBSCAN and SMOTE for imbalanced data classification

机译:DBSM:DBSCAN和SMOTE的结合用于不平衡的数据分类

获取原文

摘要

Many applications in the real world encounter the class imbalance problem. This problem affects the performance of the model prediction. Nowadays, resampling technique is a popular technique to handle the class imbalance problem such as oversampling, undersampling, and hybridsampling. Thus, this paper proposes a new hybrid resampling technique to deal with the class imbalance problem, called DBSM. The concept of DBSM is to use DBSCAN algorithm for undersampling and apply SMOTE technique for oversampling. The experimental results of the DBSM algorithm are compared with an original datasets and other sampling techniques, which are SMOTE, Tomek Links, SMOTE+Tomek Links and DBSCAN. The results show that the DBSM can improve the predictive performance of the classifiers. In addition, it yields the best in the average of AUC, F-measure, and accuracy.
机译:现实世界中的许多应用程序都遇到类不平衡问题。此问题影响模型预测的性能。如今,重采样技术是一种用于处理类不平衡问题的流行技术,例如过采样,欠采样和混合采样。因此,本文提出了一种新的混合重采样技术来解决类不平衡问题,称为DBSM。 DBSM的概念是使用DBSCAN算法进行欠采样,并使用SMOTE技术进行过采样。将DBSM算法的实验结果与原始数据集和其他采样技术(SMOTE,Tomek链接,SMOTE + Tomek链接和DBSCAN)进行了比较。结果表明,DBSM可以提高分类器的预测性能。此外,它在AUC,F量度和准确性的平均值方面均达到最佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号