...
首页> 外文期刊>The Science of the Total Environment >A framework for automated anomaly detection in high frequency water-quality data from in situ sensors
【24h】

A framework for automated anomaly detection in high frequency water-quality data from in situ sensors

机译:自动检测来自原位传感器的高频水质数据的框架

获取原文
获取原文并翻译 | 示例
           

摘要

Monitoring the water quality of rivers is increasingly conducted using automated in situ sensors, enabling timelier identification of unexpected values or trends. However, the data are confounded by anomalies caused by technical issues, for which the volume and velocity of data preclude manual detection. We present a framework for automated anomaly detection in high-frequency water-quality data from in situ sensors, using turbidity, conductivity and river level data collected from rivers flowing into the Great Barrier Reef. After identifying end-user needs and defining anomalies, we ranked anomaly importance and selected suitable detection methods. High priority anomalies included sudden isolated spikes and level shifts, most of which were classified correctly by regression-based methods such as autoregressive integrated moving average models. However, incorporation of multiple water-quality variables as covariates reduced performance due to complex relationships among variables. Classifications of drift and periods of anomalously low or high variability were more often correct when we applied mitigation, which replaces anomalous measurements with forecasts for further forecasting, but this inflated false positive rates. Feature-based methods also performed well on high priority anomalies and were similarly less proficient at detecting lower priority anomalies, resulting in high false negative rates. Unlike regression-based methods, however, all feature-based methods produced low false positive rates and have the benefit of not requiring training or optimization. Rule-based methods successfully detected a subset of lower priority anomalies, specifically impossible values and missing observations. We therefore suggest that a combination of methods will provide optimal performance in terms of correct anomaly detection, whilst minimizing false detection rates. Furthermore, our framework emphasizes the importance of communication between end-users and anomaly detection developers for optimal outcomes with respect to both detection performance and end-user application. To this end, our framework has high transferability to other types of high frequency time-series data and anomaly detection applications. (C) 2019 Elsevier B.V. All rights reserved.
机译:越来越多地使用自动原位传感器来监测河流的水质,从而能够及时识别出意外的价值或趋势。但是,数据会因技术问题引起的异常而感到困惑,因为技术问题导致数据量和速度无法进行手动检测。我们提供了一个从现场传感器获取的高频水质数据中自动异常检测的框架,它使用了从流入大堡礁的河流中收集的浊度,电导率和河流水位数据。在确定最终用户需求并定义异常后,我们对异常重要性进行了排名,并选择了合适的检测方法。高优先级异常包括突然的孤立尖峰和电平移动,其中大多数通过基于回归的方法(例如自回归综合移动平均模型)正确分类。但是,由于变量之间的复杂关系,将多个水质变量作为协变量并入会降低性能。当我们应用缓解措施时,漂移的分类和异常低或高变异期的分类通常是正确的,它用预测值代替了异常测量值以进行进一步的预测,但是虚假阳性率高涨了。基于特征的方法在高优先级异常上也表现良好,并且在检测低优先级异常上同样不熟练,从而导致较高的假阴性率。但是,与基于回归的方法不同,所有基于特征的方法产生的假阳性率均较低,并且具有无需培训或优化的优势。基于规则的方法成功地检测了优先级较低的子集,特别是不可能的值和缺少的观察值。因此,我们建议一种方法的组合将在正确的异常检测方面提供最佳性能,同时将错误检测率降到最低。此外,我们的框架强调了最终用户与异常检测开发人员之间进行通信的重要性,以实现有关检测性能和最终用户应用程序的最佳结果。为此,我们的框架对其他类型的高频时间序列数据和异常检测应用程序具有高度的可移植性。 (C)2019 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《The Science of the Total Environment》 |2019年第10期|885-898|共14页
  • 作者单位

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Queensland Univ Technol, Inst Future Environm, Brisbane, Qld, Australia|Queensland Univ Technol, Sch Math Sci, Fac Sci & Engn, Brisbane, Qld, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Queensland Univ Technol, Inst Future Environm, Brisbane, Qld, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Monash Univ, Dept Econometr & Business Stat, Clayton, Vic, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Monash Univ, Dept Econometr & Business Stat, Clayton, Vic, Australia;

    Dept Environm & Sci, Water Qual & Invest, Dutton Pk, Qld, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Queensland Univ Technol, Sch Math Sci, Fac Sci & Engn, Brisbane, Qld, Australia;

    Dept Environm & Sci, Water Qual & Invest, Dutton Pk, Qld, Australia;

    Dept Environm & Sci, Water Qual & Invest, Dutton Pk, Qld, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Monash Univ, Dept Econometr & Business Stat, Clayton, Vic, Australia;

    Dept Environm & Sci, Water Qual & Invest, Dutton Pk, Qld, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Queensland Univ Technol, Sch Math Sci, Fac Sci & Engn, Brisbane, Qld, Australia;

    ARC Ctr Excellence Math & Stat Frontiers ACEMS, Melbourne, Vic, Australia|Queensland Univ Technol, Inst Future Environm, Brisbane, Qld, Australia|Queensland Univ Technol, Sch Math Sci, Fac Sci & Engn, Brisbane, Qld, Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Big data; Forecasting; Near-real time; Quality control and assurance; River; Time series;

    机译:大数据;预测;近实时;质量控制与保证;河;时间序列;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号