Balanced Random Hyperboxes for Class Imbalanced Problems

Thanh Tung Khuat; My Hanh Le

首页> 外文期刊>IAENG Internaitonal journal of computer science >Balanced Random Hyperboxes for Class Imbalanced Problems

【24h】

Balanced Random Hyperboxes for Class Imbalanced Problems

机译：平衡随机超高框，用于类不平衡问题

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Random Hyperboxes (RH) classifier is a simple but powerful randomization-based ensemble model, including hyperbox-based classifiers used as base learners. Individual learners in this ensemble model are trained on random sub-spaces of both instance and feature spaces. This facet results in a flexible mechanism to form a high-performing classifier competitive with other ensemble models in the literature. Like other machine learning models, however, the RH classifier also faces inefficiency when dealing with class-imbalanced datasets. Meanwhile, data containing highly imbalanced class distributions are prevalent in practical applications. Hence, this paper proposes a new variant of the original RH model, namely Balance Random Hyperboxes (BRH), to bypass this drawback effectively. The proposed method uses an under-sampling strategy to build individual learners instead of the random sampling method employed in the original RH model. The experiment conducted on software fault datasets, which show a highly class-imbalanced property, indicated the proposed method's efficiency compared to the original RH model and other ensemble models.

机译：随机超高函数（RH）分类器是一个简单但强大的随机化的集合模型，包括基于超键的分类器作为基础学习者。该集合模型中的个别学习者在实例和特征空间的随机子空间上培训。这方面导致灵活的机制，以形成具有文献中的其他集合模型的高性能分类器。然而，与其他机器学习模型一样，RH分类器也在处理类别 - 不平衡数据集时面临效率。同时，在实际应用中，包含高度不平衡的类分布的数据在普遍存在。因此，本文提出了原始RH模型的新变种，即平衡随机超高箱（BRH），有效地绕过该缺点。该方法使用欠采样策略来构建单个学习者而不是原始RH模型中采用的随机采样方法。在软件故障数据集上进行的实验，该数据集显示出高度类别的属性，表明了与原始RH模型和其他集合模型相比的方法的效率。

著录项

来源
《IAENG Internaitonal journal of computer science》 |2021年第2期|406-412|共7页
作者
Thanh Tung Khuat; My Hanh Le;
展开▼
作者单位

Advanced Analytics Institute Faculty of Engineering and IT University of Technology Sydney NSW 2007 Australia;

IT Faculty University of Science and Technology The University of Danang Danang 550000 Vietnam;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Balanced random hyperboxes; ensemble learning; randomization-based learning; class-imbalanced data; software fault prediction;

机译：平衡的随机超高箱;合奏学习;基于随机的学习;类别 - 不平衡数据;软件故障预测;

相似文献

外文文献
中文文献
专利

1. Study of Hellinger Distance as a splitting metric for Random Forests in balanced and imbalanced classification datasets [J] . Aler Ricardo, Valls Jose M., Bostrom Henrik Expert systems with applications . 2020,第Jula期

机译：平衡和不平衡分类数据集随机林分裂度量的地狱距离研究
2. Cost-Sensitive Support Vector Machine Using Randomized Dual Coordinate Descent Method for Big Class-Imbalanced Data Classification [J] . MingzhuTang, ChunhuaYang, KangZhang, Abstract and applied analysis . 2014,第6期

机译：成本敏感的支持向量机，采用随机双坐标下降法进行大类别不平衡数据分类
3. Cost-Sensitive Support Vector Machine Using Randomized Dual Coordinate Descent Method for Big Class-Imbalanced Data Classification [J] . MingzhuTang, ChunhuaYang, KangZhang, Abstract and applied analysis . 2014,第3期

机译：成本敏感的支持向量机，采用随机双坐标下降法进行大类别不平衡数据分类
4. Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance [C] . Saptarshi Sinha, Hiroki Ohashi, Katsuyuki Nakamura Asian conference on computer vision . 2020

机译：Class-Wise难度 - 解决类别不平衡的均衡损失
5. A balanced approach to the multi-class imbalance problem. [D] . Mosley, Lawrence Se'kou Denu. 2013

机译：解决多类不平衡问题的一种平衡方法。
6. A Balanced Accuracy Fitness Function Leads to Robust Analysis using Grammatical Evolution Neural Networks in the Case of Class Imbalance [O] . Nicholas E. Hardison, Theresa J. Fanelli, Scott M. Dudek, -1

机译：在班级不平衡的情况下平衡精度的适应度函数导致使用语法进化神经网络进行稳健分析
7. Classification of imbalanced marketing data with balanced random sets [O] . Nikulin Vladimir, McLachlan Geoffrey J. 2009

机译：具有平衡随机集的不平衡营销数据的分类

Balanced Random Hyperboxes for Class Imbalanced Problems

摘要

著录项

相似文献

相关主题

期刊订阅