首页> 外文会议>International Work Conference on Bio-inspired Intelligence >Impact of target variable distribution type over the regression analysis in wind turbine data
【24h】

Impact of target variable distribution type over the regression analysis in wind turbine data

机译:靶变量分布类型对风力涡轮机数据回归分析的影响

获取原文

摘要

The Wind sector has roughly 2200M euros of profit losses due to wind turbine failures and these failures do not contribute to the goal of reducing greenhouse gas emissions of many states. The 25-35% of the generation costs are operation and maintenance services. To lower this ratio, the wind turbine industry is backing on the Machine Learning techniques over SCADA data. Signal trending analysis supported on linear regression models presents the problem of how to carefully choose the right target variable, which reproduces as close as possible the behavior of a failure from a component. This document evaluates the impact of that choice by comparing as target different variables with discrete-non normal distribution, commonly selected by feature selection methods, versus variables that are continuous over time with a near normal distribution. Experimental results on real data show the use of continuous target variables selected by human expert on the field give better results than the use of targets obtained through feature selection algorithm.
机译:由于风力涡轮机故障,风扇大约有2200万欧元的利润损失,这些故障对减少许多州的温室气体排放的目标没有贡献。 25-35 %的生成成本是运营和维护服务。为了降低这种比率,风力涡轮机行业正在支持SCADA数据的机器学习技术。线性回归模型支持的信号趋势分析呈现了如何仔细选择右目标变量的问题,这些变量尽可能接近组件的故障的行为。本文档通过与离散非正常分布的目标不同变量进行比较来评估该选择的影响,通常由特征选择方法,与近正常分布连续的变量连续。实验结果实验结果实际数据显示使用人类专家选择的连续目标变量在现场提供更好的结果,而不是通过通过特征选择算法获得的目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号