首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >A multi-agent reinforcement learning algorithm with fuzzy approximation for Distributed Stochastic Unit Commitment
【24h】

A multi-agent reinforcement learning algorithm with fuzzy approximation for Distributed Stochastic Unit Commitment

机译:具有分布式随机单位承诺的模糊近似多功能加强学习算法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a novel multi-agent unit commitment model under Smart Grid (SG) environment to minimize the demand satisfaction error and production cost. This is a distributed solution applicable in non-deterministic environments with stochastic parameters intending to solve Distributed Stochastic Unit Commitment (DSUC) problem. We use multi-agent reinforcement learning (RL) in which agents learn as independent learners to cooperatively satisfy the demand profile. The learning mechanism proceeds using a reward signal, which is given based on the performance of the entire system as well as the impact of the joint action of the agents. The learning agent utilizes a novel multi-agent version of Fuzzy Least Square Policy Iteration (FLSPI) as a model-free RL algorithm to approximate Q-function. Based on this approximation, the agent makes the best decision to achieve the goals while considering the constraints governing the system. Uncertainty sources in our definition of the problem are fluctuations in the predicted demand function, random productions of clean energy generators and the possibility of accidental failure in power generators. Training for one time interval (i.e. one season or one year) consisting of several time intervals (i.e. days) can be simultaneously conducted by one trial in our method. We have conducted our experiment in two different frameworks. These frameworks are defined based on the problem complexity in terms of the number of generators, the uncertainties in the environment and the system constraints. The results show that the learning agent learns to satisfy the demand profile as well as other constrains.
机译:本文提出了智能电网(SG)环境下的新型多功能单位承诺模型,以最大限度地减少需求满意度误差和生产成本。这是一种可应用于非确定性环境的分布式解决方案,随机参数打算解决分布式随机单位承诺(DSUC)问题。我们使用多功能钢筋学习(RL),其中代理商作为独立学习者学习,以合作满足需求概况。学习机制使用奖励信号进行,该信号基于整个系统的性能以及代理的联合动作的影响。学习代理利用一个新的多代理版本的模糊最小二乘策略迭代(FLSPI)作为无模型RL算法,以近似Q函数。基于此近似,代理在考虑管理系统的约束时实现目标的最佳决定。在我们对问题定义中的不确定性来源是预测需求功能的波动,清洁能源发生器的随机制作以及发电机中意外失效的可能性。由多个时间间隔(即,天)组成的一个时间间隔(即一个季节或一年)培训可以在我们的方法中同时进行一次试验。我们在两个不同的框架中进行了我们的实验。这些框架是基于在生成器数量的问题复杂性,环境中的不确定性和系统约束方面来定义的。结果表明,学习代理学习以满足需求配置文件以及其他约束。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号