Comparing Policy Gradient and Value Function Based Reinforcement Learning Methods in Simulated Electrical Power Trade

Lincoln R.; Galloway S.; Stephen B.; Burt G.

首页> 外文期刊>Power Systems, IEEE Transactions on >Comparing Policy Gradient and Value Function Based Reinforcement Learning Methods in Simulated Electrical Power Trade

【24h】

Comparing Policy Gradient and Value Function Based Reinforcement Learning Methods in Simulated Electrical Power Trade

机译：模拟电力贸易中基于策略梯度和价值函数的强化学习方法比较

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In electrical power engineering, reinforcement learning algorithms can be used to model the strategies of electricity market participants. However, traditional value function based reinforcement learning algorithms suffer from convergence issues when used with value function approximators. Function approximation is required in this domain to capture the characteristics of the complex and continuous multivariate problem space. The contribution of this paper is the comparison of policy gradient reinforcement learning methods, using artificial neural networks for policy function approximation, with traditional value function based methods in simulations of electricity trade. The methods are compared using an AC optimal power flow based power exchange auction market model and a reference electric power system model.

机译：在电力工程中，强化学习算法可用于对电力市场参与者的策略进行建模。但是，传统的基于价值函数的强化学习算法在与价值函数逼近器配合使用时会遇到收敛问题。在此域中需要函数逼近来捕获复杂且连续的多元问题空间的特征。本文的贡献是将策略梯度强化学习方法与基于传统价值函数方法的电力贸易模拟方法进行了比较，该方法使用人工神经网络进行策略函数逼近。使用基于交流最优潮流的电力交易拍卖市场模型和参考电力系统模型对方法进行比较。

著录项

来源
《Power Systems, IEEE Transactions on》 |2012年第1期|p.373-380|共8页
作者
Lincoln R.; Galloway S.; Stephen B.; Burt G.;
展开▼
作者单位

Department of Electronic and Electrical Engineering, The University of Strathclyde, Glasgow, Scotland;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Artificial intelligence; game theory; gradient methods; learning control systems; neural network applications; power system economics;

机译：人工智能;博弈论;梯度法;学习控制系统;神经网络应用;电力系统经济学;

相似文献

外文文献
中文文献
专利

1. Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail [J] . Eleni Vasilaki, Nicolas Frémaux, Robert Urbanczik, PLoS Computational Biology . 2009,第12期

机译：连续状态和动作空间中基于峰值的强化学习：当策略梯度方法失败时
2. A Collaborative Multiagent Reinforcement Learning Method Based on Policy Gradient Potential [J] . Zhen Zhang, Yew-Soon Ong, Dongqing Wang, Cybernetics, IEEE Transactions on . 2021,第2期

机译：一种基于政策梯度潜力的协同多合作加固学习方法
3. Training a robust reinforcement learning controller for the uncertain system based on policy gradient method [J] . Li Zhan, Xue Shengri, Lin Weiyang, Neurocomputing . 2018,第NOVa17期

机译：基于策略梯度法的不确定系统鲁棒强化学习控制器训练
4. Policy gradient reinforcement learning method for discrete-time linear quadratic regulation problem using estimated state value function [C] . Tomotake Sasaki, Eiji Uchibe, Hidenao Iwane, Annual Conference of the Society of Instrument and Control Engineers of Japan . 2017

机译：基于估计状态值函数的离散线性二次调节问题的策略梯度强化学习方法
5. Policy-Aware Model Learning for Policy Gradient Methods [D] . Abachi, Romina . 2020

机译：政策感知模型学习策略梯度方法
6. Correction: Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail [O] . Eleni Vasilaki, Nicolas Frémaux, Robert Urbanczik, 2009

机译：更正：在连续状态和动作空间中基于峰值的强化学习：当策略梯度方法失败时
7. Comparing policy gradient and value function based reinforcement learning methods in simulated electrical power trade [O] . Lincoln, Richard, Galloway, Stuart, Stephen, Bruce, 2012

机译：模拟电力贸易中基于策略梯度和价值函数的强化学习方法比较

Comparing Policy Gradient and Value Function Based Reinforcement Learning Methods in Simulated Electrical Power Trade

摘要

著录项

相似文献

相关主题

期刊订阅