首页> 外文期刊>International Journal of Data Science and Analytics >Unsupervised online detection and prediction of outliers in streams of sensor data
【24h】

Unsupervised online detection and prediction of outliers in streams of sensor data

机译:在无监督的情况下在线检测和预测传感器数据流中的异常值

获取原文
获取原文并翻译 | 示例
           

摘要

Outliers are unexpected observations, which deviate from the majority of observations. Outlier detection and prediction are challenging tasks, because outliers are rare by definition. A stream is an unbounded source of data, which has to be processed promptly. This article proposes novel methods for outlier detection and outlier prediction in streams of sensor data. The outlier detection is an independent, unsupervised process, which is implemented using an autoencoder. The outlier detection continuously evaluates if the latest data point x_i from a stream is an inlier or an outlier. This distinction is based on the reconstruction cost accompanied with Chebyshev's inequality and the EWMA (exponentially weighted moving average) model. The outlier prediction uses the results of the outlier detection to form the required training data. The outlier prediction utilizes LR (logistic regression), SGD (stochastic gradient descent) and the hidden representation provided by the autoencoder to predict outliers in streams. The results of the experiments show that the proposed methods (1) provide accurate results, (2) are calculated in reduced computation time and (3) use a low amount of memory. Our proposed methods are suitable for analyzing streams of sensor data and providing results with low latency. The experiments also indicated that the outlier prediction is able to anticipate the occurrence of outliers in streams of sensor data.
机译:离群值是意料之外的观察结果,与大多数观察结果不同。离群值的检测和预测是具有挑战性的任务,因为离群值在定义上很少见。流是无限的数据源,必须立即进行处理。本文提出了用于传感器数据流中异常值检测和异常值预测的新方法。离群值检测是一个独立的,无监督的过程,可使用自动编码器实现。离群值检测连续评估来自流的最新数据点x_i是离群值还是离群值。这种区别是基于重建成本,切比雪夫(Chebyshev)不平等和EWMA(指数加权移动平均线)模型。离群值预测使用离群值检测的结果来形成所需的训练数据。离群值预测利用LR(逻辑回归),SGD(随机梯度下降)和自动编码器提供的隐藏表示来预测流中的离群值。实验结果表明,所提出的方法(1)提供了准确的结果,(2)减少了计算时间,并且(3)使用了少量的内存。我们提出的方法适用于分析传感器数据流并提供低延迟的结果。实验还表明,离群值预测能够预测传感器数据流中离群值的发生。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号