首页> 外文期刊>Energies >Application of a Gradient Descent Continuous Actor-Critic Algorithm for Double-Side Day-Ahead Electricity Market Modeling
【24h】

Application of a Gradient Descent Continuous Actor-Critic Algorithm for Double-Side Day-Ahead Electricity Market Modeling

机译:梯度下降连续主演算法在双面日前电力市场建模中的应用

获取原文
       

摘要

An important goal of China’s electric power system reform is to create a double-side day-ahead wholesale electricity market in the future, where the suppliers (represented by GenCOs) and demanders (represented by DisCOs) compete simultaneously with each other in one market. Therefore, modeling and simulating the dynamic bidding process and the equilibrium in the double-side day-ahead electricity market scientifically is not only important to some developed countries, but also to China to provide a bidding decision-making tool to help GenCOs and DisCOs obtain more profits in market competition. Meanwhile, it can also provide an economic analysis tool to help government officials design the proper market mechanisms and policies. The traditional dynamic game model and table-based reinforcement learning algorithm have already been employed in the day-ahead electricity market modeling. However, those models are based on some assumptions, such as taking the probability distribution function of market clearing price ( MCP ) and each rival’s bidding strategy as common knowledge (in dynamic game market models), and assuming the discrete state and action sets of every agent (in table-based reinforcement learning market models), which are no longer applicable in a realistic situation. In this paper, a modified reinforcement learning method, called gradient descent continuous Actor-Critic (GDCAC) algorithm was employed in the double-side day-ahead electricity market modeling and simulation. This algorithm can not only get rid of the abovementioned unrealistic assumptions, but also cope with the Markov decision-making process with continuous state and action sets just like the real electricity market. Meanwhile, the time complexity of our proposed model is only O( n ). The simulation result of employing the proposed model in the double-side day-ahead electricity market shows the superiority of our approach in terms of participant’s profit or social welfare compared with traditional reinforcement learning methods.
机译:中国电力系统改革的重要目标是在未来建立双向的日间批发电力市场,供方(由GenCO代表)和需求方(由DisCO代表)在一个市场中同时竞争。因此,科学地模拟和模拟双面日间电力市场中的动态竞价过程和均衡不仅对某些发达国家很重要,而且对于中国提供一个竞价决策工具以帮助GenCO和DisCO获得在市场竞争中获得更多利润。同时,它还可以提供一种经济分析工具,以帮助政府官员设计适当的市场机制和政策。传统的动态博弈模型和基于表的强化学习算法已被用于日前电力市场建模中。但是,这些模型基于一些假设,例如将市场清算价格(MCP)的概率分布函数和每个竞争对手的出价策略作为常识(在动态游戏市场模型中),并假设每个市场的离散状态和行为集代理(在基于表的强化学习市场模型中),在现实情况下不再适用。本文在双面日前电力市场建模和仿真中采用了一种改进的强化学习方法,称为梯度下降连续Actor-Critic(GDCAC)算法。该算法不仅可以摆脱上述不切实际的假设,而且可以像真实的电力市场一样,以连续的状态和动作集来应对马尔可夫决策过程。同时,我们提出的模型的时间复杂度仅为O(n)。在双向日前电力市场中采用该模型的仿真结果表明,与传统的强化学习方法相比,我们的方法在参与者的利润或社会福利方面具有优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号