...
首页> 外文期刊>Applied Energy >Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning
【24h】

Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning

机译:基于深度加强学习的智能多区住宅HVAC控制策略

获取原文
获取原文并翻译 | 示例
           

摘要

Residential heating, ventilation, and air conditioning (HVAC) has been considered as an important demand response resource. However, the optimization of residential HVAC control is no trivial task due to the complexity of the thermal dynamic models of buildings and uncertainty associated with both occupant-driven heat loads and weather forecasts. In this paper, we apply a novel model-free deep reinforcement learning (RL) method, known as the deep deterministic policy gradient (DDPG), to generate an optimal control strategy for a multi-zone residential HVAC system with the goal of minimizing energy consumption cost while maintaining the users' comfort. The applied deep RL-based method learns through continuous interaction with a simulated building environment and without referring to any prior model knowledge. Simulation results show that compared with the state-of-art deep Q network (DQN), the DDPG-based HVAC control strategy can reduce the energy consumption cost by 15% and reduce the comfort violation by 79%; and when compared with a rule-based HVAC control strategy, the comfort violation can be reduced by 98%. In addition, experiments with different building models and retail price models demonstrate that the well-trained DDPG-based HVAC control strategy has high generalization and adaptability to unseen environments, which indicates its practicability for real-world implementation.
机译:住宅加热,通风和空调(HVAC)被认为是重要的需求响应资源。然而,由于与乘员驱动的热负荷和天气预报相关的建筑物的热动力学模型和不确定性的复杂性,优化住宅HVAC控制的优化是没有琐碎的任务。在本文中,我们应用了一种新的无模型深度加强学习(RL)方法,称为深度确定性政策梯度(DDPG),为多区住宅HVAC系统产生最佳控制策略,其目标是最小化能量的目标消费成本,同时保持用户的舒适度。应用的深度RL的方法通过与模拟建筑环境的连续交互以及不参考任何先前的模型知识来学习。仿真结果表明,与最先进的深度Q网络(DQN)相比,基于DDPG的HVAC控制策略可以将能源消耗降低15%,并将舒适性违规降低79%;与基于规则的HVAC控制策略相比,舒适性违规可以减少98%。此外,不同建筑模型和零售价格模型的实验表明,训练有素的基于DDPG的HVAC控制策略具有高泛化和对看不见的环境的适应性,这表明其现实世界实施的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号