首页> 外文期刊>Smart Grid, IEEE Transactions on >Constrained EV Charging Scheduling Based on Safe Deep Reinforcement Learning
【24h】

Constrained EV Charging Scheduling Based on Safe Deep Reinforcement Learning

机译:基于安全深度加强学习的受限EV充电调度

获取原文
获取原文并翻译 | 示例
           

摘要

Electric vehicles (EVs) have been popularly adopted and deployed over the past few years because they are environment-friendly. When integrated into smart grids, EVs can operate as flexible loads or energy storage devices to participate in demand response (DR). By taking advantage of time-varying electricity prices in DR, the charging cost can be reduced by optimizing the charging/discharging schedules. However, since there exists randomness in the arrival and departure time of an EV and the electricity price, it is difficult to determine the optimal charging/discharging schedules to guarantee that the EV is fully charged upon departure. To address this issue, we formulate the EV charging/discharging scheduling problem as a constrained Markov Decision Process (CMDP). The aim is to find a constrained charging/discharging scheduling strategy to minimize the charging cost as well as guarantee the EV can be fully charged. To solve the CMDP, a model-free approach based on safe deep reinforcement learning (SDRL) is proposed. The proposed approach does not require any domain knowledge about the randomness. It directly learns to generate the constrained optimal charging/discharging schedules with a deep neural network (DNN). Unlike existing reinforcement learning (RL) or deep RL (DRL) paradigms, the proposed approach does not need to manually design a penalty term or tune a penalty coefficient. Numerical experiments with real-world electricity prices demonstrate the effectiveness of the proposed approach.
机译:电动汽车(EVS)在过去几年中受到普遍采用和部署,因为它们是环保的。当集成到智能网格中时,EVS可以作为灵活的负载或能量存储设备操作,以参与需求响应(DR)。通过利用DR中的时变电价,通过优化充电/放电时间表,可以降低充电成本。然而,由于EV的到来和出发时间存在随机性和电价,因此难以确定最佳充电/放电时间表,以保证在出发时完全收取EV。要解决此问题,我们将EV充电/放电调度问题标记为约束的Markov决策过程(CMDP)。目的是找到一个受限制的充电/放电调度策略,以最大限度地降低充电成本以及保证EV可以充分充电。为了解决CMDP,提出了一种基于安全深增强学习(SDRL)的无模型方法。所提出的方法不需要任何关于随机性的域名知识。它直接学习用深神经网络(DNN)生成受约束的最佳充电/放电计划。与现有的增强学习(RL)或Deep RL(DRL)范例不同,所提出的方法不需要手动设计罚款或调整惩罚系数。现实世界电价的数值实验表明了提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号