首页> 外文期刊>IEEE transactions on automation science and engineering >Unsupervised Anomaly Detection Based on Minimum Spanning Tree Approximated Distance Measures and its Application to Hydropower Turbines
【24h】

Unsupervised Anomaly Detection Based on Minimum Spanning Tree Approximated Distance Measures and its Application to Hydropower Turbines

机译:基于最小生成树近似距离测度的无监督异常检测及其在水轮机中的应用

获取原文
获取原文并翻译 | 示例

摘要

Anomalies are data points or a cluster of data points that lie away from the neighboring points or clusters and are inconsistent with the overall pattern of the data. Anomaly detection techniques help distinguish the anomalous observations from the regular ones, and thus provide the basis for developing a standard performance guideline for process control. The process of identifying anomalies becomes complicated in the absence of labeled training data as in supervised learning. Moreover, Euclidean distance between two points is less likely able to reflect the intrinsic structural distance imposed by the underlying manifold structure. In this paper, the authors propose a minimum spanning tree (MST)-based anomaly detection method. The merit of the method is that an MST provides a new distance measure, capable of capturing the relative connectedness of data points/clusters in a complicated manifold, and could be a better (dis)similarity metric, than the simple Euclidean distance, to identify anomalies in unsupervised learning settings. The proposed method is compared with 13 popular anomaly detection methods on 20 benchmark data sets, demonstrating a considerable improvement in its ability of identifying anomalies. Furthermore, the MST-based anomaly detection is applied to the data set from a hydropower turbine and demonstrates remarkable detection competence.Note to Practitioners-This paper is motivated by the problem of unsupervised anomaly detection in a hydropower generation plant, which operates with turbine systems that are instrumented with dozens of sensors. Each turbine has subcomponents or functional areas such as several bearing systems, a generator, and so on. Sensors collect various types of data in real time such as temperature of oil inside the bearing systems, temperature of the bearings, ambient temperature, vibrations in each functional areas, a variety of harmonics in functional areas, temperature of the coil in the generator, and many more. In total, each turbine collects more than 200 attributes from its sensors. The sensor data are then stored in a control system and kept as time stamped historical data points. When a service/maintenance engineer suspects that there is a malfunction in a turbine, she/he extracts a data set from the control system that contains the collected sensor data for that turbine for the selected period of time (few weeks to few months), and then stores this data in a relational databases or simply in a comma separate value (csv) file for further analysis. The objective is to efficiently identify and isolate anomalies in the turbines. Toward this goal, we propose a new solution for tackling this challenging problem, which is an unsupervised method based on the concept of MST. The proposed method can be used as a competitive tool to aid the practitioners in their search of anomalies for making their systems better.
机译:异常是远离相邻点或群集的数据点或一组数据点,并且与数据的整体模式不一致。异常检测技术有助于将异常观察与常规观察区分开,从而为制定过程控制的标准性能准则提供了基础。在缺乏标记的训练数据的情况下,如在监督学习中一样,识别异常的过程变得复杂。而且,两点之间的欧几里得距离不太可能反映由下层歧管结构施加的固有结构距离。在本文中,作者提出了一种基于最小生成树(MST)的异常检测方法。该方法的优点在于,MST提供了一种新的距离度量,能够捕获复杂流形中数据点/簇的相对连接性,并且比简单的欧几里得距离可以作为更好的(不相似)度量来识别无监督学习设置中的异常。将该方法与20种基准数据集上的13种流行异常检测方法进行了比较,证明了其识别异常能力的显着提高。此外,基于MST的异常检测已应用于水电涡轮机的数据集,并显示出卓越的检测能力。从业者注意-本文受到水轮机运行的水电厂无监督异常检测问题的启发装有数十个传感器。每个涡轮机都有子组件或功能区域,例如几个轴承系统,一个发电机等。传感器实时收集各种类型的数据,例如轴承系统内部的油温,轴承温度,环境温度,每个功能区域的振动,功能区域的各种谐波,发电机线圈的温度以及还有很多。每个涡轮机总共从其传感器收集200多个属性。然后将传感器数据存储在控制系统中,并作为带有时间戳的历史数据点保存。当服务/维护工程师怀疑涡轮机出现故障时,她/他从控制系统中提取一个数据集,其中包含在选定的时间段(几周到几个月)内收集到的该涡轮机的传感器数据,然后将此数据存储在关系数据库中,或仅存储在逗号分隔值(csv)文件中以进行进一步分析。目的是有效地识别和隔离涡轮中的异常。为了实现这一目标,我们提出了一种新的解决方案来解决这一具有挑战性的问题,这是一种基于MST概念的无监督方法。所提出的方法可以用作竞争工具,以帮助从业人员寻找异常情况,以使他们的系统更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号