Inverse Reinforcement Learning for Identifcation in Linear-Quadratic Dynamic Games

Florian K?pf; Jairo Inga; Simon Rothfu?; Michael Flad; S?ren Hohmann

首页> 外文期刊>IFAC PapersOnLine >Inverse Reinforcement Learning for Identifcation in Linear-Quadratic Dynamic Games

【24h】

Inverse Reinforcement Learning for Identifcation in Linear-Quadratic Dynamic Games

机译：用于线性二次动态博弈的辨识的逆强化学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The theory of dynamic games has received considerable attention in a wide range of felds. While great efort has been made to develop new algorithms for fnding Nash equilibria in dynamic games, the identifcation of cost functions has received little attention. We present an identifcation algorithm for linear quadratic dynamic games, a framework which can be applied in the feld of shared control between a human and an automatic controller. In this application, the cost function describing human behavior is identifed, taking into account the infuence of the automation. Furthermore, we consider that human movement underlies certain variability by using a probabilistic Inverse Reinforcement Learning approach. As identifcation is performed in a single optimization step, the proposed method is suited for real-time applications. A simulation example shows that the algorithm successfully identifes the cost function of the frst player which—in combination with the second player—reproduces the observed system output.

机译：动态博弈理论在广泛的领域中受到了相当大的关注。尽管已经竭尽全力开发用于在动态游戏中寻找纳什均衡的新算法，但是成本函数的识别却很少受到关注。我们提出了一种线性二次动态博弈的识别算法，该框架可以应用于人类和自动控制器之间的共享控制。在此应用程序中，考虑到自动化的影响，确定了描述人类行为的成本函数。此外，我们认为通过使用概率逆向强化学习方法，人类的运动是某些可变性的基础。由于识别是在单个优化步骤中执行的，因此该方法适用于实时应用。一个仿真示例表明，该算法成功地确定了第一个播放器的成本函数，该成本函数与第二个播放器结合使用，再现了观察到的系统输出。

著录项

来源
《IFAC PapersOnLine》 |2017年第1期|共7页
作者
Florian K?pf; Jairo Inga; Simon Rothfu?; Michael Flad; S?ren Hohmann;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Inverse Optimal Control for Identifcation in Non-Cooperative Diferential Games [J] . Simon Rothfu?, Jairo Inga, Florian K?pf, IFAC PapersOnLine . 2017,第1期

机译：非合作差分博弈辨识的逆最优控制
2. Markov-game modeling of cyclist-pedestrian interactions in shared spaces: A multi-agent adversarial inverse reinforcement learning approach [J] . Alsaleh Rushdi, Sayed Tarek Transportation research . 2021,第Jula期

机译：广播空间中骑自行车者行人互动的马尔可夫 - 游戏模型
3. A Semi-Markov Decision Model With Inverse Reinforcement Learning for Recognizing the Destination of a Maneuvering Agent in Real Time Strategy Games [J] . Zeng Yunxiu, Xu Kai, Qin Long, Quality Control, Transactions . 2020,第期

机译：具有反增强学习的半马尔可夫决策模型，用于识别实时战略游戏中的机动代理目的地
4. Inverse Reinforcement Learning for Identification in Linear-Quadratic Dynamic Games [C] . Florian Kopf, Jairo Inga, Simon Rothfuss, IFAC World Congress . 2018

机译：线性二次动态游戏中识别识别的逆钢筋学习
5. Adversarial Inverse Reinforcement Learning with Changing Dynamics [D] . Tirinzoni, Andrea. 2017

机译：动态变化的对抗性逆向强化学习
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games [O] . Muhammad Aneeq uz Zaman, Kaiqing Zhang, Erik Miehling, 2020

机译：在非静止离散时间线性 - 二次平均野外游戏中的加固学习

Inverse Reinforcement Learning for Identifcation in Linear-Quadratic Dynamic Games

摘要

著录项

相似文献

相关主题

期刊订阅