首页> 外文期刊>Applied Soft Computing >Synergizing reinforcement learning and game theory - A new direction for control
【24h】

Synergizing reinforcement learning and game theory - A new direction for control

机译:协同强化学习与博弈论-控制的新方向

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Reinforcement learning (RL) has now evolved as a major technique for adaptive optimal control of nonlinear systems. However, majority of the RL algorithms proposed so far impose a strong constraint on the structure of environment dynamics by assuming that it operates as a Markov decision process (MDP). An MDP framework envisages a single agent operating in a stationary environment thereby limiting the scope of application of RL to control problems. Recently, a new direction of research has focused on proposing Markov games as an alternative system model to enhance the generality and robustness of the RL based approaches. This paper aims to present this new direction that seeks to synergize broad areas of RL and Game theory, as an interesting and challenging avenue for designing intelligent and reliable controllers. First, we briefly review some representative RL algorithms for the sake of completeness and then describe the recent direction that seeks to integrate RL and game theory. Finally, open issues are identified and future research directions outlined.
机译:强化学习(RL)现在已经发展成为一种用于非线性系统自适应最优控制的主要技术。但是,到目前为止,大多数提出的RL算法通过假设其作为马尔可夫决策过程(MDP)来对环境动力学的结构施加强力约束。 MDP框架设想在固定环境中运行的单个代理程序,从而限制了RL用于控制问题的应用范围。最近,新的研究方向集中在提出马尔可夫博弈作为替代系统模型以增强基于RL的方法的通用性和鲁棒性。本文旨在提出这个新的方向,力求使RL和博弈论的广泛领域协同作用,作为设计智能和可靠控制器的有趣而富挑战性的途径。首先,为了完整起见,我们简要回顾一些代表性的RL算法,然后描述寻求将RL和博弈论相结合的最新方向。最后,确定未解决的问题并概述未来的研究方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号