首页> 外文会议>European conference on machine learning and principles and practice of knowledge discovery in databases >Classification of High-Dimensional Evolving Data Streams via a Resource-Efficient Online Ensemble
【24h】

Classification of High-Dimensional Evolving Data Streams via a Resource-Efficient Online Ensemble

机译:通过资源高效的在线集合对高维演进数据流进行分类

获取原文

摘要

A novel online ensemble strategy, ensemble BPegasos(EBPegasos), is proposed to solve the problems simultaneously caused by concept drifting and the curse of dimensionality in classifying high-dimensional evolving data streams, which has not been addressed in the literature. First, EBPegasos uses BPegasos, an online kernelized SVM-based algorithm, as the component classifier to address the scalability and sparsity of high-dimensional data. Second, EBPegasos takes full advantage of the characteristics of BPegasos to cope with various types of concept drifts. Specifically, EBPegasos constructs diverse component classifiers by controlling the budget size of BPegasos; it also equips each component with a drift detector to monitor and evaluate its performance, and modifies the ensemble structure only when large performance degradation occurs. Such conditional structural modification strategy makes EBPegasos strike a good balance between exploiting and forgetting old knowledge. Lastly, we first prove experimentally that EBPegasos is more effective and resource-efficient than the tree ensembles on high-dimensional data. Then comprehensive experiments on synthetic and real-life datasets also show that EBPegasos can cope with various types of concept drifts significantly better than the state-of-the-art ensemble frameworks when all ensembles use BPegasos as the base learner.
机译:提出了一种新颖的在线集成策略BPegasos(EBPegasos),以解决概念漂移和维数诅咒同时对高维演化数据流进行分类的问题,这在文献中没有得到解决。首先,EBPegasos使用BPegasos(一种基于SVM的在线内核化算法)作为组件分类器,以解决高维数据的可伸缩性和稀疏性。其次,EBPegasos充分利用了BPegasos的特性来应对各种类型的概念漂移。具体来说,EBPegasos通过控制BPegasos的预算大小来构造各种组件分类器。它还为每个组件配备了一个漂移检测器,以监视和评估其性能,并且仅在出现较大的性能下降时才修改整体结构。这种有条件的结构修改策略使EBPegasos在开发和遗忘旧知识之间取得了良好的平衡。最后,我们首先通过实验证明EBPegasos比在高维数据上集成的树更有效,资源效率更高。然后,在合成和现实数据集上进行的综合实验还表明,当所有乐团都使用BPegasos作为基础学习者时,EBBPegasos可以比各种最新的集成框架更好地应对各种类型的概念漂移。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号