首页> 外文会议>IEEE Wireless Communications and Networking Conference >Outlier detection in sensed data using statistical learning models for IoT
【24h】

Outlier detection in sensed data using statistical learning models for IoT

机译:使用用于物联网的统计学习模型检测数据中的异常值

获取原文
获取外文期刊封面目录资料

摘要

Internet of Things (IoT) devices are composed of millions of sensors that continuously sense environmental parameters which are effectively fused to gain insights on a particular area or region based on which desired actions are taken. However, before any fusion of the data takes place, checking the quality of data is of paramount importance as these data may also be contaminated with outliers which limit their overall efficiency. Outliers or in other words, any abrupt change in values can be either due to a drastic change in environment (Event) or because of any malfunctioning of the sensor (Error). This paper introduces an IoT architecture to detect the occurrence of both Error and Event in a forest environment with the help of four statistical models, i.e., Classification and Regression Trees (CART), Random Forest (RF), Gradient Boosting Machine (GBM) and Linear Discriminant Analysis (LDA). We take into account the spatial and temporal dependencies of the data as well. Simulation results show that the models can effectively detect both types of outliers with upto 100% accuracy for Error detection and upto 98.51% in case of Event detection. Also, the results provide evidence that using spatial and temporal parameters in event detection can have a positive influence on the prediction accuracy which was observed to increase from 81.68% to 96.53% using RF. Among the four models, RF is seen to outperform the others.
机译:物联网(IoT)设备由数百万个传感器组成,这些传感器不断感应环境参数,这些参数有效地融合在一起,以根据需要采取的行动获得对特定区域或区域的洞察力。但是,在进行任何数据融合之前,检查数据质量至关重要,因为这些数据也可能被异常值所污染,从而限制了它们的整体效率。离群值或换句话说,任何突然的值变化可能是由于环境的急剧变化(事件),也可能是由于传感器的任何故障(错误)。本文介绍了一种物联网架构,借助四种统计模型来检测森林环境中错误和事件的发生,即分类和回归树(CART),随机森林(RF),梯度提升机(GBM)和线性判别分析(LDA)。我们还考虑了数据的时空依赖性。仿真结果表明,该模型可以有效地检测出两种类型的异常值,用于错误检测的精度高达100%,而在进行事件检测的情况下,精度高达98.51%。同样,结果提供了证据,即在事件检测中使用空间和时间参数可以对预测准确性产生积极影响,使用RF可以观察到预测准确性从81.68%提高到96.53%。在这四个模型中,RF被认为优于其他模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号