Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning

首页> 外文期刊>Optimal Control Applications and Methods >Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning

【24h】

Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning

机译：通过截止策略强化学习对线性离散时间系统非零和游戏的最佳跟踪控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, a model-free off-policy reinforcement learning algorithm is applied to address the optimal tracking problem based on multiplayer non-zero-sum games for discrete-time linear systems. In contrast to the traditional method and the policy iteration method for solving the optimal tracking problems, the proposed algorithm operates with the system data rather than the knowledge of the system dynamics. For performing the proposed algorithm, an auxiliary augmented system is constructed via assembling the original system and the reference trajectory while a discount factor is introduced into the performance indexes. It is analyzed that the solutions of the proposed algorithm converge to the Nash equilibrium and the result is not influenced by the probing noise. Two simulations are presented to verify the feasibility and effectiveness of the proposed algorithm.

机译：在本文中，应用了无模型的脱助策略加强学习算法来解决基于用于离散时间线性系统的多人非零和游戏的最佳跟踪问题。与传统方法和策略迭代方法相比解决了解决最佳跟踪问题的方法，所提出的算法与系统数据运行而不是系统动态的知识。为了执行所提出的算法，通过组装原始系统和参考轨迹构造辅助增强系统，而折扣因子被引入性能索引。分析了所提出的算法的解决方案会聚到纳什均衡，结果不受探测噪声的影响。提出了两种模拟以验证所提出的算法的可行性和有效性。

著录项

来源
《Optimal Control Applications and Methods》 |2020年第4期|共18页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
discrete-time; non-zero-sum games; off-policy; optimal tracking control;

机译：离散时间;非零和游戏;脱核;最佳跟踪控制;
入库时间 2022-08-20 05:22:46

相似文献

外文文献
中文文献
专利

1. Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning [J] . Optimal Control Applications and Methods . 2020,第4期

机译：通过截止策略强化学习对线性离散时间系统非零和游戏的最佳跟踪控制
2. Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method [J] . Jiang He, Zhang Huaguang, Luo Yanhong, Neurocomputing . 2016,第juna19期

机译：基于数据的强化学习方法对完全未知的非线性离散时间马尔可夫跳跃系统的最优跟踪控制
3. Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics [J] . Bahare Kiumarsi, Frank L. Lewis, Hamidreza Modares, Automatica . 2014,第4期

机译：增强Q学习，用于动态未知的线性离散时间系统的最优跟踪控制
4. Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics [C] . Chao-Ran Wang, Huai-Ning Wu Chinese Automation Congress . 2018

机译：非策略强化学习，用于动态未知的线性离散时间系统的最优预知跟踪控制
5. Optimal tracking control of uncertain systems: On-policy and off-policy reinforcement learning approaches [D] . Modares, Hamidreza 2015

机译：不确定系统的最优跟踪控制：基于策略和基于策略的强化学习方法
6. Design of an Optimal Preview Controller for Linear Discrete-Time Descriptor Noncausal Multirate Systems [O] . Mengjuan Cao, Fucheng Liao -1

机译：线性离散时间非因果多速率系统的最优预览控制器设计
7. Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning [O] . Zhenfei Xiao, Jinna Li, Ping Li 2020

机译：输出反馈H∞控制线性离散时间多人多人系统，使用脱离策略Q-Learning具有多源干扰

Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅