首页> 外文期刊>World Wide Web >Multi-window based ensemble learning for classification of imbalanced streaming data
【24h】

Multi-window based ensemble learning for classification of imbalanced streaming data

机译:基于多窗口的集成学习用于不平衡流数据的分类

获取原文
获取原文并翻译 | 示例
           

摘要

Imbalanced streaming data is commonly encountered in real-world data mining and machine learning applications, and has attracted much attention in recent years. Both imbalanced data and streaming data in practice are normally encountered together; however, little research work has been studied on the two types of data together. In this paper, we propose a multi-window based ensemble learning method for the classification of imbalanced streaming data. Three types of windows are defined to store the current batch of instances, the latest minority instances, and the ensemble classifier. The ensemble classifier consists of a set of latest sub-classifiers, and the instances employed to train each sub-classifier. All sub-classifiers are weighted prior to predicting the class labels of newly arriving instances, and new sub-classifiers are trained only when the precision is below a predefined threshold. Extensive experiments on synthetic datasets and real-world datasets demonstrate that the new approach can efficiently and effectively classify imbalanced streaming data, and generally outperforms existing approaches.
机译:流数据不平衡在现实世界的数据挖掘和机器学习应用中经常遇到,并且近年来引起了很多关注。实际上,通常会同时遇到不平衡数据和流数据。但是,关于这两种数据的研究很少。在本文中,我们提出了一种基于多窗口的集成学习方法,用于不平衡流数据的分类。定义了三种类型的窗口来存储当前的实例批次,最新的少数实例和整体分类器。集成分类器包括一组最新的子分类器,以及用于训练每个子分类器的实例。在预测新到达实例的类别标签之前,对所有子分类器进行加权,并且仅在精度低于预定义阈值时才训练新的子分类器。在合成数据集和现实数据集上进行的大量实验表明,该新方法可以有效地对不平衡的流数据进行分类,并且通常优于现有方法。

著录项

  • 来源
    《World Wide Web》 |2017年第6期|1507-1525|共19页
  • 作者

    Li Hu; Wang Ye; Wang Hua; Zhou Bin;

  • 作者单位

    Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China;

    Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China|Victoria Univ, Ctr Appl Informat, Melbourne, Vic, Australia;

    Victoria Univ, Ctr Appl Informat, Melbourne, Vic, Australia;

    Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China|Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Streaming data; Class imbalance; Multi-window; Ensemble learning;

    机译:流数据;班级失衡;多窗口;整合学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号