A Unifying Framework for Detecting Outliers and Change Points from Non-Stationary Time Series Data

机译：从非平稳时间序列数据中检测异常值和变更点的统一框架

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We are concerned with the issues of outlier detection and change point detection from a data stream. In the area of data mining, there have been increased interest in these issues since the former is related to fraud detection, rare event discovery, etc., while the latter is related to event/trend change detection, activity monitoring, etc. Specifically, it is important to consider the situation where the data source is non-stationary, since the nature of data source may change over time in real applications. Although in most previous work outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them on the basis of the theory of on-line learning of non-stationary time series. In this framework a probabilistic model of the data source is incrementally learned using an on-line discounting learning algorithm, which can track the changing data source adap-tively by forgetting the effect of past data gradually. Then the score for any given data is calculated to measure its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. Further change points in a data stream are detected by applying this scoring method into a time series of moving averaged losses for prediction using the learned model. Specifically we develop an efficient algorithms for on-line discounting learning of auto-regression models from time series data, and demonstrate the validity of our framework through simulation and experimental applications to stock market data analysis.

机译：我们关注数据流中离群值检测和变化点检测的问题。在数据挖掘领域，由于前者与欺诈检测，罕见事件发现等有关，而后者与事件/趋势变化检测，活动监视等有关，因此人们对这些问题的兴趣日益增加。重要的是要考虑数据源不稳定的情况，因为在实际应用中数据源的性质可能会随时间而变化。尽管在大多数以前的工作中，离群检测和变化点检测没有明确关联，但是本文基于非平稳时间序列的在线学习理论，提出了一个统一的框架来处理这两者。在此框架中，使用在线折扣学习算法增量学习数据源的概率模型，该算法可以通过逐渐忘记过去数据的影响来自适应地跟踪变化的数据源。然后，计算任何给定数据的分数，以衡量其与学习模型的偏差，分数越高，表示离群的可能性越大。通过将这种计分方法应用于移动平均损失的时间序列以使用学习的模型进行预测，可以检测数据流中的其他变化点。具体来说，我们开发了一种有效的算法，用于从时间序列数据进行在线折扣学习自动回归模型，并通过对股市数据分析的仿真和实验应用证明了我们框架的有效性。

著录项

来源
《Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Jul 23-26, 2002, Edmonton》|2002年|p.676-681|共6页
会议地点
作者
Kenji Yamanishi; Jun-ichi Takeuchi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A unifying framework for detecting outliers and change points from time series [J] . Takeuchi J., Yamanishi K. IEEE Transactions on Knowledge and Data Engineering . 2006,第4期

机译：用于检测时间序列中的异常值和变化点的统一框架
2. A unified statistical framework for detecting trends in multi-timescale precipitation extremes: application to non-stationary intensity-duration-frequency curves [J] . Chagnaud Guillaume, Panthou Geremy, Vischel Theo, Theoretical and applied climatology . 2021,第1a2期

机译：用于检测多时间尺度降低极值趋势的统一统计框架：应用于非静止强度持续时间曲线的应用
3. Method for Detecting and Eliminating Data Time Series Outlier in High-Speed Process Data Sensors [J] . Fariza Tebueva, Vladimir Kopytov, Viacheslav Petrenko, International Journal on Communications Antenna and Propagation . 2017,第7期

机译：高速过程数据传感器中数据时间序列离群值的检测和消除方法
4. A unifying framework for detecting outliers and change points from non-stationary time series data [C] . Kenji Yamanishi, Jun-ichi Takeuchi Proceedings of the Eighth ACM SIGKDD international conference on knowledge discovery and data mining(KDD-2000) . 2002

机译：一个用于从非平稳时间序列数据中检测离群值和变化点的统一框架
5. The use of temporally aggregated data on detecting a structural change of a time series process. [D] . Lee, Bu Hyoung. 2016

机译：在检测时间序列过程的结构变化时使用时间汇总数据。
6. Cause-specific mortality time series analysis: a general method to detect and correct for abrupt data production changes [O] . Grégoire Rey, Albertine Aouba, Gérard Pavillon, 2011

机译：特定原因的死亡率时间序列分析：一种检测和纠正突然的数据产生变化的通用方法
7. A Unifying Framework for Detecting Outliers and Change Points from Non-Stationary Time Series Data [O] . Kenji Yamanishi, Jun-ichi Takeuchi 2002

机译：从非平稳时间序列数据中检测异常值和变更点的统一框架
8. Detecting Outliers in Energy Time-Series Data [R] . Chernick, M. R., Downing, D. J., Pike, D. H. 1981

机译：检测能量时间序列数据中的异常值

A Unifying Framework for Detecting Outliers and Change Points from Non-Stationary Time Series Data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅