【24h】

Evolving stream classification using change detection

机译:使用变更检测对流进行分类

获取原文

摘要

Classifying instances in evolving data stream is a challenging task because of its properties, e.g., infinite length, concept drift, and concept evolution. Most of the currently available approaches to classify stream data instances divide the stream data into fixed size chunks to fit the data in memory and process the fixed size chunk one after another. However, this may lead to failure of capturing the concept drift immediately. We try to determine the chunk size dynamically by exploiting change point detection (CPD) techniques on stream data. In general, the distribution families before and after the change point are unknown over the stream, therefore non-parametric CPD algorithms are used in this case. We propose a multi-dimensional non-parametric CPD technique to determine chunk boundary over data streams dynamically which leads to better models to classify instances of evolving data streams. Experimental results show that our approach can detect the change points and classify instances of evolving data stream with high accuracy as compared to other baseline approaches.
机译:由于数据实例具有无限长,概念漂移和概念演化等特性,因此在不断发展的数据流中对实例进行分类是一项具有挑战性的任务。当前,大多数可用于对流数据实例进行分类的方法将流数据划分为固定大小的块,以将数据放入内存中,并逐个处理固定大小的块。但是,这可能导致无法立即捕获概念偏差。我们尝试通过利用流数据上的变化点检测(CPD)技术动态确定块大小。通常,更改点之前和之后的分布族在整个流中是未知的,因此在这种情况下使用非参数CPD算法。我们提出了一种多维非参数CPD技术来动态确定数据流上的块边界,这会导致更好的模型来对不断发展的数据流实例进行分类。实验结果表明,与其他基准方法相比,我们的方法可以检测变化点并以高准确度对不断变化的数据流实例进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号