首页> 外文期刊>International journal of data mining, modelling and management >Online multi-dimensional regression analysis on concept-drifting data streams
【24h】

Online multi-dimensional regression analysis on concept-drifting data streams

机译:概念漂移数据流的在线多维回归分析

获取原文
获取原文并翻译 | 示例
           

摘要

Stream data is generated continuously in a dynamic environment, with huge volume and fast changing behaviour. In order to perform regression on data streams, it is required to incrementally reconstruct the regression model as new stream data flows in. However, due to the tremendous data volume, it is impossible to scan the entire data stream multiple times to re-compute the regression model parameters. Therefore, one-scan algorithms are required for such streaming applications. In this paper, we investigate online multi-dimensional regression analysis of concept-drifting data streams, and present two algorithms, approximate stream regression (ASR) and ensemble stream regression (ESR). ASR approach dynamically re-computes the regression function parameters, considering not only the data records of the current window, but also a synopsis of the previous data. ESR approach trains an ensemble of regression models from sequential chunks of the data stream, and then computes the weighted average of their predictions. Experiments show that the proposed methods are not only efficient in time and space but also able to generate better fitted regression functions compared to the existing stream regression algorithms such as sliding window regression and incremental stream regression.
机译:流数据是在动态环境中连续生成的,具有巨大的数量和快速变化的行为。为了对数据流执行回归,需要随着新的流数据流入逐步地重建回归模型。但是,由于庞大的数据量,不可能多次扫描整个数据流以重新计算回归模型参数。因此,这种流应用程序需要单扫描算法。在本文中,我们研究了概念漂移数据流的在线多维回归分析,并提出了两种算法,近似流回归(ASR)和集成流回归(ESR)。 ASR方法不仅考虑当前窗口的数据记录,而且考虑先前数据的概要,从而动态地重新计算回归函数参数。 ESR方法从数据流的连续块中训练出一组回归模型,然后计算其预测的加权平均值。实验表明,与现有的流回归算法(如滑动窗口回归和增量流回归)相比,所提出的方法不仅在时间和空间上都是有效的,而且能够生成更好的拟合回归函数。

著录项

  • 来源
  • 作者单位

    Department of Computer and Information Science, Indiana University - Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA;

    Department of Computer and Information Science, Indiana University - Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA;

    Department of Electrical and Computer Engineering, Indiana University - Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA;

    Department of Electrical and Computer Engineering, Indiana University - Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA;

    Department of Electrical and Computer Engineering, Indiana University - Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    regression; data streams; ensemble; concept-drift; data modelling;

    机译:回归数据流;合奏;概念漂移数据建模;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号