首页> 外文期刊>Future generation computer systems >A non-canonical hybrid metaheuristic approach to adaptive data stream classification
【24h】

A non-canonical hybrid metaheuristic approach to adaptive data stream classification

机译:自适应数据流分类的非规范混合元启发式方法

获取原文
获取原文并翻译 | 示例
       

摘要

Data stream classification techniques have been playing an important role in big data analytics recently due to their diverse applications (e.g. fraud and intrusion detection, forecasting and healthcare monitoring systems) and the growing number of real-world data stream generators (e.g. loT devices and sensors, websites and social network feeds). Streaming data is often prone to evolution over time. In this context, the main challenge for computational models is to adapt to changes, known as concept drifts, using data mining and optimisation techniques. We present a novel ensemble technique called RED-PSO that seamlessly adapts to different concept drifts in non-stationary data stream classification tasks. RED-PSO is based on a three-layer architecture to produce classification types of different size, each created by randomly selecting a certain percentage of features from a pool of features of the target data stream. An evolutionary algorithm, namely, Replicator Dynamics (RD), is used to seamlessly adapt to different concept drifts; it allows good performing types to grow and poor performing ones to shrink in size. In addition, the selected feature combinations in all classification types are optimised using a non-canonical version of the Particle Swarm Optimisation (PSO) technique for each layer individually. PSO allows the types in each layer to go towards local (within the same type) and global (in all types) optimums with a specified velocity. A set of experiments are conducted to compare the performance of the proposed method to state-of-the-art algorithms using real-world and synthetic data streams in immediate and delayed prequential evaluation settings. The results show a favourable performance of our method in different environments. (C) 2019 Elsevier B.V. All rights reserved.
机译:数据流分类技术由于其多样化的应用(例如欺诈和入侵检测,预测和医疗保健监控系统)以及越来越多的实际数据流生成器(例如loT设备和传感器)而在大数据分析中发挥了重要作用。 ,网站和社交网络供稿)。流数据通常易于随时间演变。在这种情况下,计算模型的主要挑战是使用数据挖掘和优化技术来适应称为概念漂移的变化。我们提出了一种称为RED-PSO的新颖合奏技术,该技术可无缝适应非平稳数据流分类任务中的不同概念漂移。 RED-PSO基于三层体系结构,可以生成不同大小的分类类型,每种分类类型都是通过从目标数据流的特征池中随机选择一定百分比的特征而创建的。一种进化算法,即复制器动力学(RD),用于无缝适应不同的概念漂移。它允许表现良好的类型增长,而表现较差的类型则缩小。此外,使用非典型版本的粒子群优化(PSO)技术针对每个图层分别优化所有分类类型中的选定特征组合。 PSO允许每层中的类型以指定的速度向局部(同一类型)和全局(所有类型)最优方向发展。进行了一组实验,以在即时和延迟的优先评估设置中,使用实际和合成数据流将建议的方法的性能与最新算法进行比较。结果显示了我们的方法在不同环境下的良好性能。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号