...
首页> 外文期刊>Journal of Computer and Communications >Survey and Proposal of an Adaptive Anomaly Detection Algorithm for Periodic Data Streams
【24h】

Survey and Proposal of an Adaptive Anomaly Detection Algorithm for Periodic Data Streams

机译:周期数据流自适应异常检测算法的研究与建议

获取原文
           

摘要

Real-time anomaly detection of massive data streams is an important research topic nowadays due to the fact that a lot of data is generated in continuous temporal processes. There is a broad research area, covering mathematical, statistical, information theory methodologies for anomaly detection. It addresses various problems in a lot of domains such as health, education, finance, government, etc. In this paper, we analyze the state-of-the-art of data streams anomaly detection techniques and algorithms for anomaly detection in data streams (time series data). Critically surveying the techniques’ performances under the challenge of real-time anomaly detection of massive high-velocity streams, we conclude that the modeling of the normal behavior of the stream is a suitable approach. We evaluate Holt-Winters (HW), Taylor’s Double Holt-Winters (TDHW), Hierarchical temporal memory (HTM), Moving Average (MA), Autoregressive integrated moving average (ARIMA) forecasting models, etc. Holt-Winters (HW) and Taylor’s Double Holt-Winters (TDHW) forecasting models are used to predict the normal behavior of the periodic streams, and to detect anomalies when the deviations of observed and predicted values exceeded some predefined measures. In this work, we propose an enhancement of this approach and give a short description about the algorithms and then they are categorized by type of pre-diction as: predictive and non-predictive algorithms. We implement the Genetic Algorithm (GA) to periodically optimize HW and TDHW smoothing parameters in addition to the two sliding windows parameters that improve Hyndman’s MASE measure of deviation, and value of the threshold parameter that defines no anomaly confidence interval [1]. We also propose a new optimization function based on the input training datasets with the annotated anomaly intervals, in order to detect the right anomalies and minimize the number of false ones. The proposed method is evaluated on the known anomaly detection benchmarks NUMENTA and Yahoo datasets with annotated anomalies and real log data generated by the National education information system (NEIS)~(1) in Macedonia.
机译:由于大量数据是在连续的时间过程中生成的,因此实时检测海量数据流是当今的重要研究课题。有一个广泛的研究领域,涵盖了用于异常检测的数学,统计,信息论方法。它解决了许多领域的各种问题,例如卫生,教育,金融,政府等。在本文中,我们分析了数据流异常检测技术的最新技术和用于数据流异常检测的算法(时间序列数据)。通过对大规模高速流的实时异常检测的挑战严谨地调查了这些技术的性能,我们得出结论,对流的正常行为进行建模是一种合适的方法。我们评估Holt-Winters(HW),Taylor's Double Holt-Winters(TDHW),分层时间记忆(HTM),移动平均(MA),自回归综合移动平均(ARIMA)预测模型等。Holt-Winters(HW)和泰勒的Double Holt-Winters(TDHW)预测模型用于预测周期性流的正常行为,并在观测值和预测值的偏差超过某些预定量度时检测异常。在这项工作中,我们提出了对此方法的增强,并简要介绍了算法,然后将它们按预测类型分为:预测算法和非预测算法。我们采用遗传算法(GA)来定期优化HW和TDHW平滑参数,此外还改进了两个滑动窗口参数,这些参数改进了Hyndman的MASE偏差度量,以及阈值参数的值(该阈值未定义异常置信区间[1])。我们还基于输入的训练数据集(带有注释的异常间隔)提出了一种新的优化函数,以检测正确的异常并最大程度地减少错误的数量。该方法在已知异常检测基准NUMENTA和Yahoo数据集上进行了评估,该数据集带有带注释的异常和由马其顿国家教育信息系统(NEIS)〜(1)生成的实际日志数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号