首页> 外文会议>International conference on big data analytics >Oversample Based Large Scale Support Vector Machine for Online Class Imbalance Problem
【24h】

Oversample Based Large Scale Support Vector Machine for Online Class Imbalance Problem

机译:基于过采样的大规模支持向量机在线类不平衡问题

获取原文
获取外文期刊封面目录资料

摘要

Dealing with online class imbalance from evolving stream is a critical issue than the conventional class imbalance problem. Usually, the class imbalance problem occurs when one class of data severely outnumbers the other classes of data, thus leads to skewed class boundaries. In the case of online class imbalance problem, the degree of class imbalance changes over time and the present state of imbalance is not known a prior to the learner. To address such problem, in this paper, we present an Oversampling based Online Large Scale Support Vector Machine (OOLASVM) algorithm which is a hybrid of active sample selection and over sampling of Support Vectors and thereby both over-sampling and under sampling coexists while learning the new boundary. Further, OOLASVM maintains the balanced boundary throughout the learning process. Results on simulated and real world datasets demonstrate that proposed OOLASVM yields better performance than existing approaches such as Generalized Oversampling based Online Imbalanced Learners and Over Online Bagging.
机译:与传统的班级不平衡问题相比,从不断发展的流中处理在线班级不平衡是一个关键问题。通常,当一个类的数据严重超过其他类的数据时,就会发生类不平衡问题,从而导致类边界偏斜。在在线班级不平衡问题的情况下,班级不平衡的程度会随着时间而变化,并且学习者之前不知道当前的不平衡状态。为了解决这一问题,在本文中,我们提出了一种基于过采样的在线大规模支持向量机(OOLASVM)算法,该算法是主动样本选择和支持向量过采样的混合,从而在学习过程中过采样和欠采样共存新边界。此外,OOLASVM在整个学习过程中保持平衡的边界。在模拟和真实世界数据集上的结果表明,与现有方法(例如基于在线不平衡学习者的广义过采样和在线装袋)相比,拟议的OOLASVM产生了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号