Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control

Biemann Marco; Scheller Fabian; Liu Xiufeng; Huang Lizhen

首页> 外文期刊>Applied Energy >Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control

【24h】

Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control

机译：无模型加固学习算法的实验评价，用于连续HVAC控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Controlling heating, ventilation and air-conditioning (HVAC) systems is crucial to improving demand-side energy efficiency. At the same time, the thermodynamics of buildings and uncertainties regarding human activities make effective management challenging. While the concept of model-free reinforcement learning demonstrates various advantages over existing strategies, the literature relies heavily on value-based methods that can hardly handle complex HVAC systems. This paper conducts experiments to evaluate four actor-critic algorithms in a simulated data centre. The performance evaluation is based on their ability to maintain thermal stability while increasing energy efficiency and on their adaptability to weather dynamics. Because of the enormous significance of practical use, special attention is paid to data efficiency. Compared to the model based controller implemented into EnergyPlus, all applied algorithms can reduce energy consumption by at least 10% by simultaneously keeping the hourly average temperature in the desired range. Robustness tests in terms of different reward functions and weather conditions verify these results. With increasing training, we also see a smaller trade-off between thermal stability and energy reduction. Thus, the Soft Actor Critic algorithm achieves a stable performance with ten times less data than on-policy methods. In this regard, we recommend using this algorithm in future experiments, due to both its interesting theoretical properties and its practical results.

机译：控制加热，通风和空调（HVAC）系统对于提高需求侧能量效率至关重要。与此同时，建筑物的热力学和有关人类活动的不确定性取得有效的管理挑战。虽然无模型加强学习的概念来说，展示了对现有策略的各种优势，但文献依赖于基于价值的方法，这几乎无法处理复杂的HVAC系统。本文进行了在模拟数据中心中评估四个演员批评算法的实验。性能评估基于它们能够保持热稳定性的同时增加能量效率以及对天气动态的适应性。由于实际使用的巨大意义，特别关注数据效率。与基于模型的控制器相比，通过实施到EnergyPlus，所有应用算法可以通过同时将每小时平均温度保持在所需范围内的每小时平均气温来降低至少10％。在不同奖励功能和天气条件方面的稳健性测试验证了这些结果。随着培训的增加，我们还看到了热稳定性和能量减少之间的较小折衷。因此，软演员批评算法稳定地实现了比策略方法减少了十倍的性能。在这方面，我们建议在未来的实验中使用该算法，这是其有趣的理论属性及其实际结果。

著录项

来源
《Applied Energy》 |2021年第15期|117164.1-117164.18|共18页
作者
Biemann Marco; Scheller Fabian; Liu Xiufeng; Huang Lizhen;
展开▼
作者单位

Tech Univ Denmark Dept Technol Management & Econ DK-2800 Lyngby Denmark|Norwegian Univ Sci & Technol Dept Mfg & Civil Engn N-2815 Gjovik Norway;

Tech Univ Denmark Dept Technol Management & Econ DK-2800 Lyngby Denmark;

Tech Univ Denmark Dept Technol Management & Econ DK-2800 Lyngby Denmark;

Norwegian Univ Sci & Technol Dept Mfg & Civil Engn N-2815 Gjovik Norway;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; Continuous HVAC control; Actor-critic algorithms; Robustness; Energy efficiency; Soft Actor Critic;

机译：加强学习;连续HVAC控制;演员 - 评论家算法;鲁棒性;能源效率;软演员评论家;

相似文献

外文文献
中文文献
专利

1. Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms [J] . Suryan Varun, Gondhalekar Nahush, Tokekar Pratap IEEE Robotics & Automation Magazine . 2020,第2期

机译：高斯工艺的多程度强化学习：基于模型和无模型算法
2. Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms [J] . Suryan Varun, Gondhalekar Nahush, Tokekar Pratap Mathematical research letters: MRL . 2020,第2期

机译：高斯工艺的多程度强化学习：基于模型和无模型算法
3. Model-free learning control of neutralization processes using reinforcement learning [J] . S. Syafiie, F. Tadeo, E. Martinez Engineering Applications of Artificial Intelligence . 2007,第6期

机译：使用强化学习的中和过程的无模型学习控制
4. Formal Controller Synthesis for Continuous-Space MDPs via Model-Free Reinforcement Learning [C] . Abolfazl Lavaei, Fabio Somenzi, Sadegh Soudjani, IEEE/ACM International Conference on Cyber-Physical Systems . 2020

机译：通过无模型强化学习的连续空间MDP形式控制器综合
5. Dynamic tuning of PI-controllers based on model-free Reinforcement Learning methods. [D] . Abbasi Brujeni, Lena. 2010

机译：基于无模型强化学习方法的PI控制器的动态调整。
6. Control of neural systems at multiple scales using model-free deep reinforcement learning [O] . B. A. Mitchell, L. R. Petzold -1

机译：使用无模型的深度强化学习以多尺度控制神经系统
7. Formal Controller Synthesis for Continuous-Space MDPs via Model-Free Reinforcement Learning [O] . Abolfazl Lavaei, Fabio Somenzi, Sadegh Soudjani, 2020

机译：通过无模型增强学习的连续空间MDP的正式控制器合成

Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control

摘要

著录项

相似文献

相关主题

期刊订阅