首页> 外文会议>International Symposium on Automation and Robotics in Construction and Mining >Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data
【24h】

Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data

机译:使用标准化时间序列数据生成用于构建服务中的机器学习应用程序的通用数据集

获取原文

摘要

Machine Learning Algorithms (ML) offer a high potential with low manual effort to discover appropriate energy efficiency measures for buildings. Although many building automation systems (BAS) record a high amount of data, technical systems such as boilers provide only a few data points per building. However, machine-learning algorithms require training based on a sufficient number of instances of a technical system in order to enable cross-building use. In contrast to electrical systems, few data sets of actual operation of thermal systems are publicly available. Since 2012, the monitoring system in our test object has continuously provided threshold-based data with a maximum resolution of 1 minute. We monitor the plants, energy consumption and comfort parameters with 9239 data points in total. In this paper, we show how our published data set from this building is structured. In order to facilitate the use of ML, each data point receives a uniform label according to a previously developed approach. Since the documentation of ML data sets varies in the building sector, we show an approach to standardize data sets with special datasheets for thermal systems to provide sufficient information for application of ML. We use the Brick Schema, a unified ontology standard for the description of topology in buildings, which is part of the future ASHRAE Standard 223P. We couple this with an approach we developed for the structured labeling of data points in buildings. We show how to semi-automatically generate physical models based on an open-source Modelica library from this ontology-based model. We show that the models, enriched with real time series data and data sheets, are in good agreement with the measured data. Finally, we show with an ML example that our approach based on Brick Schema and Modelica is able to deliver ML compliant data sets.
机译:机器学习算法(ML)提供高潜力,手动努力,以发现建筑物的适当能效措施。虽然许多楼宇自动化系统(BAS)记录了大量数据,但锅炉等技术系统只提供了每栋建筑的几个数据点。然而,机器学习算法需要基于足够数量的技术系统实例进行训练,以便能够实现交叉建设。与电气系统相比,很少有数据的热系统的实际操作集是公开可用的。自2012年以来,我们的测试对象中的监控系统连续提供基于阈值的数据,最大分辨率为1分钟。我们总共监控植物,能耗和舒适参数,总共有9239个数据点。在本文中,我们展示了我们从该建筑物中的已发布的数据如何构建。为了便于使用M1,每个数据点根据先前开发的方法接收均匀的标签。由于ML数据集的文档在建筑物扇区中变化,因此我们显示了一种用特殊数据集标准化数据集的方法,用于热系统,以提供ML的施加足够的信息。我们使用Brick Schema,一个统一的本体标准标准,用于建筑物中的拓扑描述,这是未来ASHRAE标准223P的一部分。我们将其与我们为建筑物中的数据点的结构化标签开发的方法进行了解决方法。我们展示了如何基于此基于本体的模型的开源Modelica库进行半自动生成物理模型。我们表明,与实时序列数据和数据表丰富的模型与测量数据很好。最后,我们展示了ML示例,即我们基于Brick Schema和Modelica的方法能够提供符合符合标准的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号