首页> 外文期刊>IAENG Internaitonal journal of computer science >Balanced Random Hyperboxes for Class Imbalanced Problems
【24h】

Balanced Random Hyperboxes for Class Imbalanced Problems

机译:平衡随机超高框,用于类不平衡问题

获取原文
获取原文并翻译 | 示例
           

摘要

A Random Hyperboxes (RH) classifier is a simple but powerful randomization-based ensemble model, including hyperbox-based classifiers used as base learners. Individual learners in this ensemble model are trained on random sub-spaces of both instance and feature spaces. This facet results in a flexible mechanism to form a high-performing classifier competitive with other ensemble models in the literature. Like other machine learning models, however, the RH classifier also faces inefficiency when dealing with class-imbalanced datasets. Meanwhile, data containing highly imbalanced class distributions are prevalent in practical applications. Hence, this paper proposes a new variant of the original RH model, namely Balance Random Hyperboxes (BRH), to bypass this drawback effectively. The proposed method uses an under-sampling strategy to build individual learners instead of the random sampling method employed in the original RH model. The experiment conducted on software fault datasets, which show a highly class-imbalanced property, indicated the proposed method's efficiency compared to the original RH model and other ensemble models.
机译:随机超高函数(RH)分类器是一个简单但强大的随机化的集合模型,包括基于超键的分类器作为基础学习者。该集合模型中的个别学习者在实例和特征空间的随机子空间上培训。这方面导致灵活的机制,以形成具有文献中的其他集合模型的高性能分类器。然而,与其他机器学习模型一样,RH分类器也在处理类别 - 不平衡数据集时面临效率。同时,在实际应用中,包含高度不平衡的类分布的数据在普遍存在。因此,本文提出了原始RH模型的新变种,即平衡随机超高箱(BRH),有效地绕过该缺点。该方法使用欠采样策略来构建单个学习者而不是原始RH模型中采用的随机采样方法。在软件故障数据集上进行的实验,该数据集显示出高度类别的属性,表明了与原始RH模型和其他集合模型相比的方法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号