Synergizing reinforcement learning and game theory - A new direction for control

Rajneesh Sharma; M. Gopal

首页> 外文期刊>Applied Soft Computing >Synergizing reinforcement learning and game theory - A new direction for control

【24h】

Synergizing reinforcement learning and game theory - A new direction for control

机译：协同强化学习与博弈论-控制的新方向

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning (RL) has now evolved as a major technique for adaptive optimal control of nonlinear systems. However, majority of the RL algorithms proposed so far impose a strong constraint on the structure of environment dynamics by assuming that it operates as a Markov decision process (MDP). An MDP framework envisages a single agent operating in a stationary environment thereby limiting the scope of application of RL to control problems. Recently, a new direction of research has focused on proposing Markov games as an alternative system model to enhance the generality and robustness of the RL based approaches. This paper aims to present this new direction that seeks to synergize broad areas of RL and Game theory, as an interesting and challenging avenue for designing intelligent and reliable controllers. First, we briefly review some representative RL algorithms for the sake of completeness and then describe the recent direction that seeks to integrate RL and game theory. Finally, open issues are identified and future research directions outlined.

机译：强化学习（RL）现在已经发展成为一种用于非线性系统自适应最优控制的主要技术。但是，到目前为止，大多数提出的RL算法通过假设其作为马尔可夫决策过程（MDP）来对环境动力学的结构施加强力约束。 MDP框架设想在固定环境中运行的单个代理程序，从而限制了RL用于控制问题的应用范围。最近，新的研究方向集中在提出马尔可夫博弈作为替代系统模型以增强基于RL的方法的通用性和鲁棒性。本文旨在提出这个新的方向，力求使RL和博弈论的广泛领域协同作用，作为设计智能和可靠控制器的有趣而富挑战性的途径。首先，为了完整起见，我们简要回顾一些代表性的RL算法，然后描述寻求将RL和博弈论相结合的最新方向。最后，确定未解决的问题并概述未来的研究方向。

著录项

来源
《Applied Soft Computing》 |2010年第3期|共14页
作者
Rajneesh Sharma; M. Gopal;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Reinforcement learning; Game theory; Markov games; Markov game based RL control;

机译：强化学习;博弈论;马尔可夫博弈;基于马尔可夫博弈的RL控制;

相似文献

外文文献
中文文献
专利

1. Synergizing reinforcement learning and game theory - A new direction for control [J] . Rajneesh Sharma, M. Gopal Applied Soft Computing . 2010,第3期

机译：协同强化学习与博弈论-控制的新方向
2. Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints [J] . Huang Miao, Liu Cong, He Xiaoqi, Neurocomputing . 2020,第Auga18期

机译：基于加强学习的非线性离散时间系统控制，控制方向和控制约束
3. Integral Reinforcement Learning-Based Adaptive NN Control for Continuous-Time Nonlinear MIMO Systems With Unknown Control Directions [J] . Xinxin Guo, Weisheng Yan, Rongxin Cui IEEE Transactions on Systems, Man, and Cybernetics . 2020,第11期

机译：基于整体加固学习的自适应NN控制，用于具有未知控制方向的连续时间非线性MIMO系统
4. Joint Reinforcement Learning and Game Theory Bitrate Control Method for 360-Degree Dynamic Adaptive Streaming [C] . Xuekai Wei, Mingliang Zhou, Sam Kwong, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：联合加固学习与博弈论比特曲线控制方法360度动态自适应流
5. Large-Scale Multi-Agent Decision-Making Using Mean Field Game Theory and Reinforcement Learning [D] . Zhou, Zejian. 2021

机译：使用均值野外博弈论和强化学习的大规模多代理决策
6. Integrating evolutionary game theory into epigenetic study ofembryonic development Comment on Epigenetic game theory: How to computethe epigenetic control of maternal-to-zygotic transition by Qian Wanget al [O] . Zuoheng Wang -1

机译：将进化博弈论整合到人类表观遗传研究中胚胎发育评论表观博弈论：如何计算母体向合子过渡的表观遗传控制王茜等
7. Truly Distributed Multicell Multi-Band Multiuser MIMO by Synergizing Game Theory and Deep Learning [O] . Kai-Kit Wong, Guochen Liu, Wenjing Cun, 2021

机译：通过协同博弈论和深度学习，真正分布的多频带多用户MIMO

Synergizing reinforcement learning and game theory - A new direction for control

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅