首页> 外文会议>Asian Conference on Intelligent Information and Database Systems >Data Preprocessing for DES-KNN and Its Application to Imbalanced Medical Data Classification
【24h】

Data Preprocessing for DES-KNN and Its Application to Imbalanced Medical Data Classification

机译:DES-KNN的数据预处理及其在不平衡医学数据分类中的应用

获取原文

摘要

Learning from imbalanced data is a vital challenge for pattern classification. We often face the imbalanced data in medical decision tasks where at least one of the classes is represented by only a very small minority of the available data. We propose a novel framework for training base classifiers and preparing the dynamic selection dataset (dsel) to integrate data preprocessing and dynamic ensemble selection (des) methods for imbalanced data classification. des-knn algorithm has been chosen as the des method and its modifications base on oversampled training and validations sets using SMOTE are discussed. The proposed modifications have been evaluated based on computer experiments carried out on 15 medical datasets with various imbalance ratios. The results of experiments show that the proposed framework is very useful, especially for tasks characterized by the small imbalance ratio.
机译:从不平衡数据中学习是模式分类的关键挑战。在医疗决策任务中,我们经常会遇到数据不平衡的问题,其中至少一个类别仅由一小部分可用数据代表。我们提出了一个用于训练基础分类器并准备动态选择数据集(dsel)的新框架,以集成数据预处理和动态集成选择(des)方法以实现不平衡数据分类。 des-knn算法已被选作des方法,并讨论了基于过采样训练和使用SMOTE的验证集对其进行的修改。根据对15种医学数据集进行的计算机实验,评估了提出的修改方案,并采用了各种不平衡率。实验结果表明,所提出的框架是非常有用的,特别是对于不平衡率较小的任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号