首页> 外文期刊>ACM transactions on knowledge discovery from data >GOOWE: Geometrically Optimum and Online-Weighted Ensemble Classifier for Evolving Data Streams
【24h】

GOOWE: Geometrically Optimum and Online-Weighted Ensemble Classifier for Evolving Data Streams

机译:GOOWE:用于不断发展的数据流的几何优化和在线加权集成分类器

获取原文
获取原文并翻译 | 示例
       

摘要

Designing adaptive classifiers for an evolving data stream is a challenging task due to the data size and its dynamically changing nature. Combining individual classifiers in an online setting, the ensemble approach, is a well-known solution. It is possible that a subset of classifiers in the ensemble outperforms others in a time-varying fashion. However, optimum weight assignment for component classifiers is a problem, which is not yet fully addressed in online evolving environments. We propose a novel data stream ensemble classifier, called Geometrically Optimum and Online-Weighted Ensemble (GOOWE), which assigns optimum weights to the component classifiers using a sliding window containing the most recent data instances. We map vote scores of individual classifiers and true class labels into a spatial environment. Based on the Euclidean distance between vote scores and ideal-points, and using the linear least squares (LSQ) solution, we present a novel, dynamic, and online weighting approach. While LSQ is used for batch mode ensemble classifiers, it is the first time that we adapt and use it for online environments by providing a spatial modeling of online ensembles. In order to show the robustness of the proposed algorithm, we use real-world datasets and synthetic data generators using the Massive Online Analysis (MOA) libraries. First, we analyze the impact of our weighting system on prediction accuracy through two scenarios. Second, we compare GOOWE with eight state-of-theart ensemble classifiers in a comprehensive experimental environment. Our experiments show that GOOWE provides improved reactions to different types of concept drift compared to our baselines. The statistical tests indicate a significant improvement in accuracy, with conservative time and memory requirements.
机译:由于数据大小及其动态变化的性质,为不断发展的数据流设计自适应分类器是一项艰巨的任务。集成方法是在在线环境中结合单个分类器,这是一种众所周知的解决方案。集成中分类器的子集可能会随时间变化而胜过其他分类器。但是,对于组件分类器的最佳权重分配是一个问题,在在线不断发展的环境中尚未完全解决。我们提出了一种新颖的数据流集成分类器,称为几何优化和在线加权集合(GOOWE),该分类器使用包含最新数据实例的滑动窗口为组件分类器分配最佳权重。我们将单个分类器的投票分数和真实的类别标签映射到空间环境中。基于投票分数和理想点之间的欧几里得距离,并使用线性最小二乘(LSQ)解决方案,我们提出了一种新颖,动态和在线的加权方法。虽然LSQ用于批处理模式集成分类器,但这是我们第一次通过提供在线合奏的空间建模来将其应用于在线环境。为了显示所提出算法的鲁棒性,我们使用了使用大规模在线分析(MOA)库的真实数据集和合成数据生成器。首先,我们通过两种情况分析加权系统对预测准确性的影响。其次,我们在综合的实验环境中将GOOWE与八个最新的集成分类器进行了比较。我们的实验表明,与我们的基线相比,GOOWE对不同类型的概念漂移提供了改进的反应。统计测试表明,准确性得到了显着提高,同时对时间和内存的要求也很保守。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号