Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning

C. Blad; S. Koch; S. Ganeswarathas; C.S. Kalles?e; S. B?gh

首页> 外文期刊>Procedia Manufacturing >Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning

【24h】

Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning

机译：使用强化学习的热力学慢的HVAC系统控制

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an adaptive controller based on Reinforcement Learning (RL), which copes with HVAC-systems consisting of slow thermodynamics. Two different RL algorithms with Q-Networks (QNs) are investigated. The HVAC-system is in this study an underfloor heating system. Underfloor heating is of great interest because it is very common in Scandinavia, but this research can be applied to a wide range of HVAC-systems, industrial processes and other control applications that are dominated by very slow dynamics. The environments consist of one, two, and four zones within a house in a simulation environment meaning that agents will be exposed to gradually more complex environments separated into test levels. The novelty of this paper is the incorporation of two different RL algorithms for industrial process control; a QN and a QN + Eligibility Trace (QN+ET). The reason for using eligibility trace is that an underfloor heating environment is dominated by slow dynamics and by using eligibility trace the agent can find correlations between the reward and actions taken in earlier iterations.

机译：本文提出了一种基于强化学习（RL）的自适应控制器，该控制器可应对由慢热力学组成的HVAC系统。研究了带有Q网络（QN）的两种不同的RL算法。在本研究中，HVAC系统是地板采暖系统。地板采暖引起了极大的兴趣，因为它在斯堪的纳维亚半岛非常普遍，但是这项研究可以应用于以缓慢的动力学为主导的各种HVAC系统，工业过程和其他控制应用。在模拟环境中，环境由房屋内的一个，两个和四个区域组成，这意味着代理将暴露于逐渐变得更复杂的环境中，这些环境分为测试级别。本文的新颖之处在于将两种不同的RL算法整合到了工业过程控制中。 QN和QN +资格跟踪（QN + ET）。使用资格跟踪的原因是，地板采暖环境主要由缓慢的动力学控制，通过使用资格跟踪，代理可以找到奖励和早期迭代中采取的措施之间的相关性。

著录项

来源
《Procedia Manufacturing》 |2019年第2015期|共8页
作者
C. Blad; S. Koch; S. Ganeswarathas; C.S. Kalles?e; S. B?gh;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类工业建设与发展;
关键词
Sustainable Manufacturing EngineeringResource-Efficient ProductionArtificial Intelligence in ManufacturingModellingSimulationHVAC-Systems;

机译：可持续制造工程制造中的资源高效生产人工智能建模模拟HVAC系统;

相似文献

外文文献
中文文献
专利

1. Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning [J] . C. Blad, S. Koch, S. Ganeswarathas, Procedia Manufacturing . 2019,第3期

机译：使用强化学习的热力学慢的HVAC系统控制
2. Suboptimal control for nonlinear slow-fast coupled systems using reinforcement learning and Takagi-Sugeno fuzzy methods [J] . Liu Xiaomin, Yang Chunyu, Luo Biao, International Journal of Adaptive Control and Signal Processing . 2021,第6期

机译：使用加固学习和Takagi-Sugeno模糊方法的非线性慢速耦合系统的次优控制
3. Reinforcement Learning Toolbox: Reinforcement Learning for Optimal Control Tasks Institute for Theoretical Computer Science TU-GRAZ [J] . Gerhard Neumann OGAI Journal . 2007,第3期

机译：强化学习工具箱：针对最优控制任务的强化学习理论计算机科学研究院TU-GRAZ
4. Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning [C] . C. Blad, S. Koch, S. Ganeswarathas, International Conference on Flexible Automation and Intelligent Manufacturing . 2020

机译：利用加固学习控制具有慢热力学的HVAC系统
5. Deep Learning and Reinforcement Learning for Inventory Control [D] . Khanidahaj, Zahra. 2018

机译：库存控制深度学习和加固学习
6. Reinforcement Learning on Slow Features of High-Dimensional Input Streams [O] . Robert Legenstein, Niko Wilbert, Laurenz Wiskott 2010

机译：高维输入流慢速特征的强化学习
7. Disrupted reinforcement learning during post-error slowing in ADHD [O] . Andre Chevrier, Mehereen Bhaijiwala, Jonathan Lipszyc, 2018

机译：在ADHD后出现错误减速时扰乱了增强学习

Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅