首页> 外文会议>IEEE-EMBS International Conference on Biomedical and Health Informatics >Imbalanced Data Classifier By Using Ensemble Fuzzy C-Means Clustering
【24h】

Imbalanced Data Classifier By Using Ensemble Fuzzy C-Means Clustering

机译:使用集合模糊C-Meanse群集使用Enbalanced数据分类器

获取原文

摘要

Pattern classifiers developed with the imbalanced data set tend to classify an object to the class with the highest number of samples, resulting in higher overall classifier accuracy but lower sensitivity. A new approach based on a dynamic under-sampling procedure is therefore proposed to improve the classification of imbalanced datasets that are quite common in bio-medicine. To overcome a class imbalance, the dataset is resampled by using the ensemble fuzzy c-means clustering method. The under-sampling procedure is then applied to the majority class to balance the size of the classes. Compared to the existing classifiers, the proposed method yields not only higher classification accuracy and sensitivity but also more stable classification performance under different data sets, classifiers and their parameters, indicating that it is independent of particular clustering or classification methods.
机译:使用不平衡数据集开发的模式分类器倾向于将对象分类为具有最多数量的样本的类,导致更高的整体分类器精度,但较低的灵敏度。因此提出了一种基于动态欠抽样程序的新方法,以改善生物医学中相当常见的不平衡数据集的分类。为了克服类别不平衡,数据集通过使用集合模糊C-means群集方法来重新采样。然后将下采样过程应用于大多数类以平衡类的大小。与现有的分类器相比,所提出的方法不仅产生更高的分类精度和灵敏度,而且在不同的数据集,分类器及其参数下的分类性能也更稳定,表明它与特定的聚类或分类方法无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号