首页> 外文会议>International Conference on Hybrid Artificial Intelligence Systems >A Cluster-Based Under-Sampling Algorithm for Class-Imbalanced Data
【24h】

A Cluster-Based Under-Sampling Algorithm for Class-Imbalanced Data

机译:基于群集的基于群的类别 - 不平衡数据的抽样算法

获取原文

摘要

The resampling methods are among the most popular strategies to face the class imbalance problem. The objective of these methods is to compensate the imbalanced class distribution by over-sampling the minority class and/or under-sampling the majority class. In this paper, a new under-sampling method based on the DBSCAN clustering algorithm is introduced. The main idea is to remove the majority class instances that are identified as noise by DBSCAN. The proposed method is empirically compared to well-known state-of-the-art under-sampling algorithms over 25 benchmarking databases and the experimental results demonstrate the effectiveness of the new method in terms of sensitivity, specificity, and geometric mean of individual accuracies.
机译:重采样方法是面对阶级不平衡问题的最受欢迎的策略之一。 这些方法的目的是通过过度抽样少数阶级和/或取样大多数类来补偿不平衡的阶级分布。 本文介绍了一种基于DBSCAN聚类算法的新的采样方法。 主要思想是删除DBSCAN被标识为噪声的多数类实例。 所提出的方法是经验与25个基准数据库的众所周知的最新的下采样算法进行比较,实验结果表明了在敏感度,特异性和各个精度的几何平均值方面的新方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号