首页> 外文会议>Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Jul 23-26, 2002, Edmonton >A Unifying Framework for Detecting Outliers and Change Points from Non-Stationary Time Series Data
【24h】

A Unifying Framework for Detecting Outliers and Change Points from Non-Stationary Time Series Data

机译:从非平稳时间序列数据中检测异常值和变更点的统一框架

获取原文
获取外文期刊封面目录资料

摘要

We are concerned with the issues of outlier detection and change point detection from a data stream. In the area of data mining, there have been increased interest in these issues since the former is related to fraud detection, rare event discovery, etc., while the latter is related to event/trend change detection, activity monitoring, etc. Specifically, it is important to consider the situation where the data source is non-stationary, since the nature of data source may change over time in real applications. Although in most previous work outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them on the basis of the theory of on-line learning of non-stationary time series. In this framework a probabilistic model of the data source is incrementally learned using an on-line discounting learning algorithm, which can track the changing data source adap-tively by forgetting the effect of past data gradually. Then the score for any given data is calculated to measure its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. Further change points in a data stream are detected by applying this scoring method into a time series of moving averaged losses for prediction using the learned model. Specifically we develop an efficient algorithms for on-line discounting learning of auto-regression models from time series data, and demonstrate the validity of our framework through simulation and experimental applications to stock market data analysis.
机译:我们关注数据流中离群值检测和变化点检测的问题。在数据挖掘领域,由于前者与欺诈检测,罕见事件发现等有关,而后者与事件/趋势变化检测,活动监视等有关,因此人们对这些问题的兴趣日益增加。重要的是要考虑数据源不稳定的情况,因为在实际应用中数据源的性质可能会随时间而变化。尽管在大多数以前的工作中,离群检测和变化点检测没有明确关联,但是本文基于非平稳时间序列的在线学习理论,提出了一个统一的框架来处理这两者。在此框架中,使用在线折扣学习算法增量学习数据源的概率模型,该算法可以通过逐渐忘记过去数据的影响来自适应地跟踪变化的数据源。然后,计算任何给定数据的分数,以衡量其与学习模型的偏差,分数越高,表示离群的可能性越大。通过将这种计分方法应用于移动平均损失的时间序列以使用学习的模型进行预测,可以检测数据流中的其他变化点。具体来说,我们开发了一种有效的算法,用于从时间序列数据进行在线折扣学习自动回归模型,并通过对股市数据分析的仿真和实验应用证明了我们框架的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号