首页> 外文学位 >Multiclass Imbalanced Learning in Ensembles through Selective Sampling.
【24h】

Multiclass Imbalanced Learning in Ensembles through Selective Sampling.

机译:通过选择性采样在集合中进行多类不平衡学习。

获取原文
获取原文并翻译 | 示例

摘要

Imbalanced learning is the problem of learning from datasets when the class proportions are highly imbalanced. Imbalanced datasets are increasingly seen in many domains and pose a challenge to traditional classification techniques. Learning from imbalanced multiclass data (three or more classes) creates additional complexities. Studies suggest that ensemble learners can be trained to emphasize different segments of data pertaining to different classes and thereby produce more accurate results than regular imbalance learning techniques. Thus, we propose a new approach to building ensembles of classifiers for multiclass imbalanced datasets, called Multiclass Imbalance Learning in Ensembles through Selective Sampling (MILES). Each member of MILES is trained with the data selectively sampled from the bands around cluster centroids in a way that diversity is aggressively encouraged within the ensemble. Resampling techniques are utilized to balance the distribution of the data that comes from each cluster. We performed several experiments applying our approach to different real-word datasets demonstrating improved performance for recognizing minority class examples and balancing the G-mean and Mean Area Under the Curve (MAUC) across classes.;We further applied MILES to classify prolonged emergency department (ED) stays with consistently higher performance as compared to existing ensemble methods.
机译:学习失衡是班级比例高度失衡时从数据集中学习的问题。不平衡的数据集在许多领域中越来越多地出现,并且对传统分类技术提出了挑战。从不平衡的多类数据(三个或更多类)中学习会带来额外的复杂性。研究表明,可以训练整体学习者强调与不同类别有关的数据的不同部分,从而比常规的不平衡学习技术产生更准确的结果。因此,我们提出了一种为多类不平衡数据集构建分类器集合的新方法,称为“通过选择性采样的集合中的多类不平衡学习”。 MILES的每个成员都接受了从簇质心周围频带中选择性采样的数据的训练,从而在集合体中积极鼓励多样性。重采样技术用于平衡来自每个群集的数据的分布。我们对不同的实词数据集进行了多次实验,证明了在识别少数族裔类别示例以及平衡跨类别的G均值和曲线下均值面积(MAUC)方面的改进性能。与现有的集成方法相比,ED)始终保持较高的性能。

著录项

  • 作者

    Azari, Ali.;

  • 作者单位

    University of Maryland, Baltimore County.;

  • 授予单位 University of Maryland, Baltimore County.;
  • 学科 Information science.;Computer science.
  • 学位 Ph.D.
  • 年度 2016
  • 页码 133 p.
  • 总页数 133
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:47:17

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号