Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data

机译：使用标准化时间序列数据生成用于构建服务中的机器学习应用程序的通用数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine Learning Algorithms (ML) offer a high potential with low manual effort to discover appropriate energy efficiency measures for buildings. Although many building automation systems (BAS) record a high amount of data, technical systems such as boilers provide only a few data points per building. However, machine-learning algorithms require training based on a sufficient number of instances of a technical system in order to enable cross-building use. In contrast to electrical systems, few data sets of actual operation of thermal systems are publicly available. Since 2012, the monitoring system in our test object has continuously provided threshold-based data with a maximum resolution of 1 minute. We monitor the plants, energy consumption and comfort parameters with 9239 data points in total. In this paper, we show how our published data set from this building is structured. In order to facilitate the use of ML, each data point receives a uniform label according to a previously developed approach. Since the documentation of ML data sets varies in the building sector, we show an approach to standardize data sets with special datasheets for thermal systems to provide sufficient information for application of ML. We use the Brick Schema, a unified ontology standard for the description of topology in buildings, which is part of the future ASHRAE Standard 223P. We couple this with an approach we developed for the structured labeling of data points in buildings. We show how to semi-automatically generate physical models based on an open-source Modelica library from this ontology-based model. We show that the models, enriched with real time series data and data sheets, are in good agreement with the measured data. Finally, we show with an ML example that our approach based on Brick Schema and Modelica is able to deliver ML compliant data sets.

机译：机器学习算法（ML）提供高潜力，手动努力，以发现建筑物的适当能效措施。虽然许多楼宇自动化系统（BAS）记录了大量数据，但锅炉等技术系统只提供了每栋建筑的几个数据点。然而，机器学习算法需要基于足够数量的技术系统实例进行训练，以便能够实现交叉建设。与电气系统相比，很少有数据的热系统的实际操作集是公开可用的。自2012年以来，我们的测试对象中的监控系统连续提供基于阈值的数据，最大分辨率为1分钟。我们总共监控植物，能耗和舒适参数，总共有9239个数据点。在本文中，我们展示了我们从该建筑物中的已发布的数据如何构建。为了便于使用M1，每个数据点根据先前开发的方法接收均匀的标签。由于ML数据集的文档在建筑物扇区中变化，因此我们显示了一种用特殊数据集标准化数据集的方法，用于热系统，以提供ML的施加足够的信息。我们使用Brick Schema，一个统一的本体标准标准，用于建筑物中的拓扑描述，这是未来ASHRAE标准223P的一部分。我们将其与我们为建筑物中的数据点的结构化标签开发的方法进行了解决方法。我们展示了如何基于此基于本体的模型的开源Modelica库进行半自动生成物理模型。我们表明，与实时序列数据和数据表丰富的模型与测量数据很好。最后，我们展示了ML示例，即我们基于Brick Schema和Modelica的方法能够提供符合符合标准的数据集。

著录项

来源
《International Symposium on Automation and Robotics in Construction and Mining》|2019年|664p|共8页
会议地点
作者
F. Stinner; Y. Yang; T. Schreiber; G. Bode; M. Baranski; D. Muller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词
Standardized data sets; Machine learning; Simulation; Modelica; Building energy systems;

机译：标准化数据集;机器学习;模拟;Modelica;建筑能源系统;
入库时间 2022-08-20 20:20:12

相似文献

外文文献
中文文献
专利

1. Making machine-learning applications for time-series sensor data graphical and interactive [J] . Kim Seungjun, Tasse Dan, Dey Anind K. ACM Transactions on Interactive Intelligent Systems . 2017,第2期

机译：为时序传感器数据的图形化和交互化提供机器学习应用程序
2. Time series prediction by a neural network model based on Bi-directional computation style: a study on generalization performance with the computer-generated time series "data set D" [J] . Hiroshi Wakuya, Katsunori Shida Systems and Computers in Japan . 2003,第10期

机译：基于双向计算方式的神经网络模型进行时间序列预测：计算机生成的时间序列“数据集D”的泛化性能研究
3. Combining data assimilation and machine learning to build data-driven models for unknown long time dynamics-Applications in cardiovascular modeling [J] . Regazzoni Francesco, Chapelle Dominique, Moireau Philippe International journal for numerical methods in biomedical engineering . 2021,第7期

机译：结合数据同化和机器学习，构建数据驱动模型，以便在心血管建模中构建未知的长时间动力学应用程序
4. Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data [C] . F. Stinner, Y. Yang, T. Schreiber, International Symposium on Automation and Robotics in Construction and Mining . 2019

机译：使用标准化时间序列数据生成用于构建服务中的机器学习应用程序的通用数据集
5. A machine learning approach to query time-series microarray data sets for functionally related genes using hidden Markov models. [D] . Senf, Alexander. 2011

机译：一种使用隐马尔可夫模型查询功能相关基因的时间序列微阵列数据集的机器学习方法。
6. ProteinNet: a standardized data set for machine learning of protein structure [O] . Mohammed AlQuraishi 2019

机译：ProteinNet：用于蛋白质结构机器学习的标准化数据集
7. The application of machine learning techniques to time-series data [O] . Mitchell R. Scott 1995

机译：机器学习技术在时间序列数据中的应用

Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data

摘要

著录项

相似文献

相关主题

期刊订阅