首页> 外文期刊>PeerJ Computer Science >Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble
【24h】

Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble

机译:使用异构多样化动态类加权集合进行漂移数据流的分类

获取原文
           

摘要

Data streams can be defined as the continuous stream of data coming from different sources and in different forms. Streams are often very dynamic, and its underlying structure usually changes over time, which may result to a phenomenon called concept drift. When solving predictive problems using the streaming data, traditional machine learning models trained on historical data may become invalid when such changes occur. Adaptive models equipped with mechanisms to reflect the changes in the data proved to be suitable to handle drifting streams. Adaptive ensemble models represent a popular group of these methods used in classification of drifting data streams. In this paper, we present the heterogeneous adaptive ensemble model for the data streams classification, which utilizes the dynamic class weighting scheme and a mechanism to maintain the diversity of the ensemble members. Our main objective was to design a model consisting of a heterogeneous group of base learners (Naive Bayes, k-NN, Decision trees), with adaptive mechanism which besides the performance of the members also takes into an account the diversity of the ensemble. The model was experimentally evaluated on both real-world and synthetic datasets. We compared the presented model with other existing adaptive ensemble methods, both from the perspective of predictive performance and computational resource requirements.
机译:数据流可以定义为来自不同源和不同形式的连续数据流。溪流通常非常动态,其潜在的结构通常会随着时间的推移而变化,这可能导致称为概念漂移的现象。在使用流数据解决预测问题时,在发生这种更改时,在历史数据上培训的传统机器学习模型可能无效。适用型号配备了机制,以反映所证明的数据的变化适合处理漂移流。 Adaptive Ensemble模型代表了一种在漂移数据流分类中使用的这些方法的流行组。在本文中,我们介绍了用于数据流分类的异构自适应集合模型,其利用动态类加权方案和一种维持集合成员的多样性的机制。我们的主要目标是设计一种模型,该模型由异构基础学习者(天真贝叶斯,K-NN,决策树)组成,具有自适应机制,除了构件的性能也将考虑到集合的多样性。该模型在实验上对现实世界和合成数据集进行了评估。我们将呈现的模型与其他现有的自适应集合方法进行比较,无论是从预测性能和计算资源要求的角度均。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号