首页> 外文期刊>IEEE transactions on automation science and engineering >Unsupervised Anomaly Detection Based on Minimum Spanning Tree Approximated Distance Measures and its Application to Hydropower Turbines
【24h】

Unsupervised Anomaly Detection Based on Minimum Spanning Tree Approximated Distance Measures and its Application to Hydropower Turbines

机译:基于最小生成树近似距离措施及其在水电涡轮机的应用无监督异常检测

获取原文
获取原文并翻译 | 示例

摘要

Anomalies are data points or a cluster of data points that lie away from the neighboring points or clusters and are inconsistent with the overall pattern of the data. Anomaly detection techniques help distinguish the anomalous observations from the regular ones, and thus provide the basis for developing a standard performance guideline for process control. The process of identifying anomalies becomes complicated in the absence of labeled training data as in supervised learning. Moreover, Euclidean distance between two points is less likely able to reflect the intrinsic structural distance imposed by the underlying manifold structure. In this paper, the authors propose a minimum spanning tree (MST)-based anomaly detection method. The merit of the method is that an MST provides a new distance measure, capable of capturing the relative connectedness of data points/clusters in a complicated manifold, and could be a better (dis)similarity metric, than the simple Euclidean distance, to identify anomalies in unsupervised learning settings. The proposed method is compared with 13 popular anomaly detection methods on 20 benchmark data sets, demonstrating a considerable improvement in its ability of identifying anomalies. Furthermore, the MST-based anomaly detection is applied to the data set from a hydropower turbine and demonstrates remarkable detection competence.Note to Practitioners-This paper is motivated by the problem of unsupervised anomaly detection in a hydropower generation plant, which operates with turbine systems that are instrumented with dozens of sensors. Each turbine has subcomponents or functional areas such as several bearing systems, a generator, and so on. Sensors collect various types of data in real time such as temperature of oil inside the bearing systems, temperature of the bearings, ambient temperature, vibrations in each functional areas, a variety of harmonics in functional areas, temperature of the coil in the generator, and many more. In total, each turbine collects more than 200 attributes from its sensors. The sensor data are then stored in a control system and kept as time stamped historical data points. When a service/maintenance engineer suspects that there is a malfunction in a turbine, she/he extracts a data set from the control system that contains the collected sensor data for that turbine for the selected period of time (few weeks to few months), and then stores this data in a relational databases or simply in a comma separate value (csv) file for further analysis. The objective is to efficiently identify and isolate anomalies in the turbines. Toward this goal, we propose a new solution for tackling this challenging problem, which is an unsupervised method based on the concept of MST. The proposed method can be used as a competitive tool to aid the practitioners in their search of anomalies for making their systems better.
机译:异常是远离相邻点或集群的数据点或数据点集群,并且与数据的整体模式不一致。异常检测技术有助于区分常规观察的异常观察,从而为开发用于过程控制的标准性能指南提供基础。在没有标记的培训数据的情况下,识别异常的过程变得复杂于监督学习中。此外,两点之间的欧几里德距离不太可能能够反映由下面的歧管结构施加的内在结构距离。在本文中,作者提出了最小的生成树(MST)基础的异常检测方法。该方法的优点是MST提供了新的距离测量,能够在复杂的歧管中捕获数据点/簇的相对连接,并且可以是更好的(DIS)相似度量,而不是简单的欧几里德距离,以识别无监督学习设置中的异常。将所提出的方法与20个基准数据集上的13个普遍的异常检测方法进行比较,展示了识别异常的能力相当大的改进。此外,基于MST基异常检测应用于来自水电涡轮机的数据集,并展示了显着的检测能力。对于从业者而言,本文通过涡轮机系统运行的无监督异常检测问题的激励用数十种传感器仪表。每个涡轮机具有子组件或功能区域,例如若干轴承系统,发电机等。传感器在实时收集各种类型的数据,例如轴承系统内的油温,轴承温度,环境温度,振动在每个功能区域,各种谐波在功能区域,发电机中的线圈温度,和还有很多。总共,每个涡轮机从其传感器收集超过200个属性。然后将传感器数据存储在控制系统中,并保持为时间戳的历史数据点。当服务/维护工程师怀疑涡轮机中存在故障时,她/他从控制系统中提取数据集,其中包含该涡轮机的收集的传感器数据,用于所选时间段(几周到几个月),然后在关系数据库中存储此数据,或者简单地以逗号单独的值(CSV)文件以进行进一步分析。目的是有效地识别和分离涡轮机中的异常。对此目标,我们提出了一种解决这一具有挑战性问题的新解决方案,这是一种基于MST概念的无人监督方法。该方法可以用作竞争工具,以帮助从业者寻求异常,以便更好地制作系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号