首页> 外文期刊>IFAC PapersOnLine >Inverse Reinforcement Learning for Identifcation in Linear-Quadratic Dynamic Games
【24h】

Inverse Reinforcement Learning for Identifcation in Linear-Quadratic Dynamic Games

机译:用于线性二次动态博弈的辨识的逆强化学习

获取原文
           

摘要

The theory of dynamic games has received considerable attention in a wide range of felds. While great efort has been made to develop new algorithms for fnding Nash equilibria in dynamic games, the identifcation of cost functions has received little attention. We present an identifcation algorithm for linear quadratic dynamic games, a framework which can be applied in the feld of shared control between a human and an automatic controller. In this application, the cost function describing human behavior is identifed, taking into account the infuence of the automation. Furthermore, we consider that human movement underlies certain variability by using a probabilistic Inverse Reinforcement Learning approach. As identifcation is performed in a single optimization step, the proposed method is suited for real-time applications. A simulation example shows that the algorithm successfully identifes the cost function of the frst player which—in combination with the second player—reproduces the observed system output.
机译:动态博弈理论在广泛的领域中受到了相当大的关注。尽管已经竭尽全力开发用于在动态游戏中寻找纳什均衡的新算法,但是成本函数的识别却很少受到关注。我们提出了一种线性二次动态博弈的识别算法,该框架可以应用于人类和自动控制器之间的共享控制。在此应用程序中,考虑到自动化的影响,确定了描述人类行为的成本函数。此外,我们认为通过使用概率逆向强化学习方法,人类的运动是某些可变性的基础。由于识别是在单个优化步骤中执行的,因此该方法适用于实时应用。一个仿真示例表明,该算法成功地确定了第一个播放器的成本函数,该成本函数与第二个播放器结合使用,再现了观察到的系统输出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号