首页> 外文会议>2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference >Hydrological Time Series Anomaly Pattern Detection based on Isolation Forest
【24h】

Hydrological Time Series Anomaly Pattern Detection based on Isolation Forest

机译:基于孤立森林的水文时间序列异常模式检测

获取原文
获取原文并翻译 | 示例

摘要

Anomaly detection is one of the basic problems in the field of data mining. It has been concerned and widely used by industry and academia. With the continuous increase of hydrological data, the current anomaly detection algorithm is too low in time efficiency. Besides, there are too many anomaly points excavated. In the face of so many anomaly points, the analysis decision-makers have no way to start. For this problem, this paper uses the isolation forest algorithm for the hydrological data of the pattern representation, and a hydrological time series anomaly pattern detection algorithm based on isolation forest is proposed. At the same time, it is difficult to determine the partition threshold in isolation forest and can not output top-k. K-means clustering respectively algorithm and nearest neighbor algorithm are used to improve the isolation forest algorithm, which can effectively overcome the subjectivity of artificially setting threshold and improve the stability of results expression. It is applied to the measured data of the Chuhe River Basin, and compared with other improved algorithms in accuracy and time complexity. The effectiveness of the improved isolation forest algorithm is verified by experiments.
机译:异常检测是数据挖掘领域的基本问题之一。它已被工业界和学术界所关注并广泛使用。随着水文数据的不断增加,目前的异常检测算法的时间效率太低。此外,挖掘的异常点太多。面对如此多的异常点,分析决策者无从下手。针对该问题,本文将隔离森林算法用于模式表示的水文数据,提出了一种基于隔离森林的水文时间序列异常模式检测算法。同时,很难确定隔离林中的分区阈值,并且无法输出top-k。采用K均值聚类算法和最近邻算法对隔离森林算法进行改进,可以有效克服人工设置阈值的主观性,提高结果表达的稳定性。将其应用于the河流域实测数据,并与其他改进算法在准确性和时间复杂度上进行比较。实验证明了改进后的隔离林算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号