首页> 外文会议>Biomedical and Health Informatics (BHI), 2012 IEEE-EMBS International Conference on >Imbalanced data classifier by using ensemble fuzzy c-means clustering
【24h】

Imbalanced data classifier by using ensemble fuzzy c-means clustering

机译:集成模糊c均值聚类的不平衡数据分类器

获取原文
获取原文并翻译 | 示例

摘要

Pattern classifiers developed with the imbalanced data set tend to classify an object to the class with the highest number of samples, resulting in higher overall classifier accuracy but lower sensitivity. A new approach based on a dynamic under-sampling procedure is therefore proposed to improve the classification of imbalanced datasets that are quite common in bio-medicine. To overcome a class imbalance, the dataset is resampled by using the ensemble fuzzy c-means clustering method. The under-sampling procedure is then applied to the majority class to balance the size of the classes. Compared to the existing classifiers, the proposed method yields not only higher classification accuracy and sensitivity but also more stable classification performance under different data sets, classifiers and their parameters, indicating that it is independent of particular clustering or classification methods.
机译:使用不平衡数据集开发的模式分类器倾向于将对象分类为样本数最多的类别,从而导致较高的整体分类器准确性,但灵敏度较低。因此,提出了一种基于动态欠采样过程的新方法,以改善生物医学中非常常见的不平衡数据集的分类。为了克服类不平衡问题,使用集成模糊c均值聚类方法对数据集进行重新采样。然后将欠采样过程应用于多数类,以平衡类的大小。与现有分类器相比,该方法不仅在分类,分类及其参数不同的数据集下具有更高的分类精度和灵敏度,而且分类性能更加稳定,表明该方法与特定的聚类或分类方法无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号