首页> 外文会议>Chinese Control Conference >Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems

【24h】

Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems

机译：离散时间不确定线性系统鲁棒控制的禁止策略加固学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, an off-policy reinforcement learning method is developed for the robust stabilizing controller design of discrete-time uncertain linear systems. The proposed robust control design consists of two steps. First, the robust control problem is transformed to an optimal control problem. Second, the off-policy RL method is used to design the optimal control policy which guarantees the robust stability of the original system with uncertainty. The condition for the equivalence between the robust control problem and the optimal control problem is discussed. The off-policy does not require any knowledge of the system knowledge and efficiently utilize the data collected from on-line to improve the performance of approximate optimal control policy in each iteration successively. Finally, a simulation example is carried out to verify the effectiveness of the presented algorithm for the robust control problem of discrete-time linear system with uncertainty.

机译：本文开发了一种用于离散时间不确定线性系统的鲁棒稳定控制器设计的脱策强化学习方法。建议的强大控制设计包括两个步骤。首先，将稳健的控制问题转换为最佳控制问题。其次，off-police rl方法用于设计最佳控制策略，保证原始系统具有不确定性的鲁棒稳定性。讨论了鲁棒控制问题与最优控制问题之间的等价的条件。违规策略不需要对系统知识的任何知识并有效地利用从在线收集的数据，以提高连续的每次迭代中的近似最佳控制策略的性能。最后，执行模拟示例以验证具有不确定性的离散时间线性系统的鲁棒控制问题的施加算法的有效性。

著录项

来源
《Chinese Control Conference 》|2017年|2253-3001p|共6页
会议地点
作者
Yongliang Yang; Zhishan Guo; Donald Wunsch; Yixin Yin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词
System uncertainty; Robust control; Optimal control; Off-policy trinforcement learning; Model-free;

机译：系统不确定性;鲁棒控制;最优控制;非政策三方面学习;无模型;

相似文献

外文文献
中文文献
专利

1. Data-Driven Robust Control of Discrete-Time Uncertain Linear Systems via Off-Policy Reinforcement Learning [J] . Neural Networks and Learning Systems, IEEE Transactions on . 2019 ,第12期

机译：基于非策略强化学习的离散时间不确定线性系统的数据驱动鲁棒控制
2. Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning [J] . Optimal Control Applications and Methods . 2020 ,第4期

机译：通过截止策略强化学习对线性离散时间系统非零和游戏的最佳跟踪控制
3. Off-Policy Interleaved $Q$ -Learning: Optimal Control for Affine Nonlinear Discrete-Time Systems [J] . Li Jinna, Chai Tianyou, Lewis Frank L., Neural Networks and Learning Systems, IEEE Transactions on . 2019 ,第5期

机译：非策略交错的 $ Q $ -学习：仿射非线性离散时间系统的最优控制
4. Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems [C] . Yongliang Yang, Zhishan Guo, Donald Wunsch, Chinese Control Conference . 2017

机译：离散时间不确定线性系统鲁棒控制的禁止策略加固学习
5. Optimal tracking control of uncertain systems: On-policy and off-policy reinforcement learning approaches [D] . Modares, Hamidreza 2015

机译：不确定系统的最优跟踪控制：基于策略和基于策略的强化学习方法
6. Robust Adaptive Control for a Class of Uncertain Nonlinear Systems with Time-Varying Delay [O] . Ruliang Wang, Jie Li, Shanshan Zhang, 2013

机译：一类具有时变时滞的不确定非线性系统的鲁棒自适应控制
7. Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning [O] . Zhenfei Xiao, Jinna Li, Ping Li 2020

机译：输出反馈H∞控制线性离散时间多人多人系统，使用脱离策略Q-Learning具有多源干扰

Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems

摘要

著录项

相似文献

相关主题

期刊订阅