GOOWE: Geometrically Optimum and Online-Weighted Ensemble Classifier for Evolving Data Streams

Bonab Hamed R.; Can Fazli

首页> 外文期刊>ACM transactions on knowledge discovery from data >GOOWE: Geometrically Optimum and Online-Weighted Ensemble Classifier for Evolving Data Streams

【24h】

GOOWE: Geometrically Optimum and Online-Weighted Ensemble Classifier for Evolving Data Streams

机译：GOOWE：用于不断发展的数据流的几何优化和在线加权集成分类器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Designing adaptive classifiers for an evolving data stream is a challenging task due to the data size and its dynamically changing nature. Combining individual classifiers in an online setting, the ensemble approach, is a well-known solution. It is possible that a subset of classifiers in the ensemble outperforms others in a time-varying fashion. However, optimum weight assignment for component classifiers is a problem, which is not yet fully addressed in online evolving environments. We propose a novel data stream ensemble classifier, called Geometrically Optimum and Online-Weighted Ensemble (GOOWE), which assigns optimum weights to the component classifiers using a sliding window containing the most recent data instances. We map vote scores of individual classifiers and true class labels into a spatial environment. Based on the Euclidean distance between vote scores and ideal-points, and using the linear least squares (LSQ) solution, we present a novel, dynamic, and online weighting approach. While LSQ is used for batch mode ensemble classifiers, it is the first time that we adapt and use it for online environments by providing a spatial modeling of online ensembles. In order to show the robustness of the proposed algorithm, we use real-world datasets and synthetic data generators using the Massive Online Analysis (MOA) libraries. First, we analyze the impact of our weighting system on prediction accuracy through two scenarios. Second, we compare GOOWE with eight state-of-theart ensemble classifiers in a comprehensive experimental environment. Our experiments show that GOOWE provides improved reactions to different types of concept drift compared to our baselines. The statistical tests indicate a significant improvement in accuracy, with conservative time and memory requirements.

机译：由于数据大小及其动态变化的性质，为不断发展的数据流设计自适应分类器是一项艰巨的任务。集成方法是在在线环境中结合单个分类器，这是一种众所周知的解决方案。集成中分类器的子集可能会随时间变化而胜过其他分类器。但是，对于组件分类器的最佳权重分配是一个问题，在在线不断发展的环境中尚未完全解决。我们提出了一种新颖的数据流集成分类器，称为几何优化和在线加权集合（GOOWE），该分类器使用包含最新数据实例的滑动窗口为组件分类器分配最佳权重。我们将单个分类器的投票分数和真实的类别标签映射到空间环境中。基于投票分数和理想点之间的欧几里得距离，并使用线性最小二乘（LSQ）解决方案，我们提出了一种新颖，动态和在线的加权方法。虽然LSQ用于批处理模式集成分类器，但这是我们第一次通过提供在线合奏的空间建模来将其应用于在线环境。为了显示所提出算法的鲁棒性，我们使用了使用大规模在线分析（MOA）库的真实数据集和合成数据生成器。首先，我们通过两种情况分析加权系统对预测准确性的影响。其次，我们在综合的实验环境中将GOOWE与八个最新的集成分类器进行了比较。我们的实验表明，与我们的基线相比，GOOWE对不同类型的概念漂移提供了改进的反应。统计测试表明，准确性得到了显着提高，同时对时间和内存的要求也很保守。

著录项

来源
《ACM transactions on knowledge discovery from data》 |2018年第2期|25.1-25.33|共33页
作者
Bonab Hamed R.; Can Fazli;
展开▼
作者单位

Bilkent Univ, Ankara, Turkey;

Bilkent Univ, Bilkent Informat Retrieval Grp, Comp Engn Dept, TR-06800 Ankara, Turkey;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Ensemble classifier; concept drift; evolving data stream; dynamic weighting; geometry of voting; least squares; spatial modeling for online ensembles;

机译：集合分类器;概念漂移;不断发展的数据流;动态加权;投票的几何形状;最小二乘;在线集合的空间建模;

相似文献

外文文献
中文文献
专利

1. Online Learning Model for Handling Different Concept Drifts Using Diverse Ensemble Classifiers on Evolving Data Streams [J] . S. Ancy, D. Paulraj Cybernetics and Systems . 2019,第5a8期

机译：使用不同集成分类器在不断发展的数据流上处理不同概念漂移的在线学习模型
2. Classification of Evolving Stream Data Using Improved Ensemble Classifier [J] . Seema S, MSRIT International Journal of Data Mining & Knowledge Management Process . 2012,第4期

机译：使用改进的集成分类器对流数据进行分类
3. Active Learning From Stream Data Using Optimal Weight Classifier Ensemble [J] . Zhu X., Zhang P., Lin X., Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on . 2010,第6期

机译：使用最优权重分类器集成从流数据主动学习
4. An Experimental Comparison of Ensemble Classifiers for Evolving Data Streams [C] . Ahmad Idris Tambuwal, Daniel Neagu, Marian Gheorghe Artificial Intelligence XXXIV . 2017

机译：演化数据流集成分类器的实验比较
5. Performance Envelopes of Adaptive Ensemble Data Stream Classifiers [D] . Joe-Yen, Stefan. 2017

机译：自适应集成数据流分类器的性能包络
6. An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data [O] . Susmita Datta, Vasyl Pihur, Somnath Datta 2010

机译：通过装袋和等级聚合的自适应最佳集成分类器并应用于高维数据
7. A Practical Approach to Classify Evolving Data Streams: Training with Limited Amount of Labeled Data [O] . Mohammad M. Masud, Jing Gao, Latifur Khan, 2012

机译：一种对进化数据流进行分类的实用方法：使用有限数量的标记数据进行训练

GOOWE: Geometrically Optimum and Online-Weighted Ensemble Classifier for Evolving Data Streams

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅