首页> 外文会议>Genetic programming >Handling Different Categories of Concept Drifts in Data Streams Using Distributed GP
【24h】

Handling Different Categories of Concept Drifts in Data Streams Using Distributed GP

机译:使用分布式GP处理数据流中不同类别的概念漂移

获取原文
获取原文并翻译 | 示例

摘要

Using Genetic Programming (GP) for classifying data streams is problematic as GP is slow compared with traditional single solution techniques. However, the availability of cheaper and better-performing distributed and parallel architectures make it possible to deal with complex problems previously hardly solved owing to the large amount of time necessary. This work presents a general framework based on a distributed GP ensemble algorithm for coping with different types of concept drift for the task of classification of large data streams. The framework is able to detect changes in a very efficient way using only a detection function based on the incoming unclassified data. Thus, only if a change is detected a distributed GP algorithm is performed in order to improve classification accuracy and this limits the overhead associated with the use of a population-based method. Real world data streams may present drifts of different types. The introduced detection function, based on the self-similarity fractal dimension, permits to cope in a very short time with the main types of different drifts, as demonstrated by the first experiments performed on some artificial datasets. Furthermore, having an adequate number of resources, distributed GP can handle very frequent concept drifts.
机译:使用遗传编程(GP)对数据流进行分类存在问题,因为与传统的单一解决方案技术相比,GP的速度较慢。但是,更便宜,性能更好的分布式和并行体系结构的可用性使处理以前由于大量时间而难以解决的复杂问题成为可能。这项工作提出了一个基于分布式GP集成算法的通用框架,用于应对大型数据流分类任务的不同类型的概念漂移。该框架仅使用基于传入未分类数据的检测功能就能够以非常有效的方式检测变化。因此,仅当检测到变化时,才执行分布式GP算法以提高分类准确性,并且这限制了与使用基于种群的方法相关的开销。现实世界中的数据流可能会出现不同类型的漂移。引入的基于自相似分形维数的检测功能可以在很短的时间内处理主要类型的不同漂移,如在某些人工数据集上进行的首次实验所证明的那样。此外,具有足够数量的资源,分布式GP可以处理非常频繁的概念漂移。

著录项

  • 来源
    《Genetic programming 》|2010年|p.74-85|共12页
  • 会议地点 Istanbul(TR);Istanbul(TR);Istanbul(TR);Istanbul(TR);Istanbul(TR)
  • 作者单位

    Institute for High Performance Computing and Networking, CNR-ICAR;

    Institute for High Performance Computing and Networking, CNR-ICAR;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 程序设计、软件工程 ;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号