首页> 外文会议>Chinese Automation Congress >Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics

【24h】

Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics

机译：具有未知动力学的线性离散时间系统的最佳预览跟踪控制的禁止策略加强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, an off-policy reinforcement learning (RL) algorithm is presented to solve the optimal preview tracking control of discrete time systems with unknown dynamics. Firstly, an augmented state-space system that includes the available preview knowledge as a part of the state vector is constructed to cast the preview tracking control problem as a standard linear quadratic regulator (LQR) one. Secondly, the reinforcement learning technique is utilized to solve the algebraic Riccati equation (ARE) using online measurable data without requiring the a priori knowledge of the system matrices. Compared with the existing off-policy RL algorithm, the proposed scheme solves a preview tracking control problem. A numerical simulation example is given to verify the effectiveness of the proposed control scheme.

机译：本文提出了一种脱策强化学习（RL）算法以解决具有未知动态的离散时间系统的最佳预览跟踪控制。首先，构造包括作为状态矢量的一部分的可用预览知识的增强状态空间系统，以将预览跟踪控制问题作为标准的线性二次调节器（LQR）。其次，利用在线可测量数据解决代数Riccati等式（AS）的加强学习技术，而不需要先验的系统矩阵知识。与现有的截止策略RL算法相比，该方案解决了预览跟踪控制问题。给出了数值模拟示例来验证所提出的控制方案的有效性。

著录项

来源
《Chinese Automation Congress》|2018年|715-1430p|共6页
会议地点
作者
Chao-Ran Wang; Huai-Ning Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词
Off-policy reinforcement learning; Optimal preview tracking control;

机译：禁止策略强化学习;最优预览跟踪控制;

相似文献

外文文献
中文文献
专利

1. Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics [J] . Bahare Kiumarsi, Frank L. Lewis, Hamidreza Modares, Automatica . 2014,第4期

机译：增强Q学习，用于动态未知的线性离散时间系统的最优跟踪控制
2. Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning [J] . Optimal Control Applications and Methods . 2020,第4期

机译：通过截止策略强化学习对线性离散时间系统非零和游戏的最佳跟踪控制
3. Synchronous optimal control method for nonlinear systems with saturating actuators and unknown dynamics using off-policy integral reinforcement learning [J] . Zhang Zenglian, Song Ruizhuo, Cao Min Neurocomputing . 2019,第SEPa3期

机译：带有饱和执行器和未知动力学的非线性系统的同步最优控制方法
4. Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics [C] . Chao-Ran Wang, Huai-Ning Wu Chinese Automation Congress . 2018

机译：非策略强化学习，用于动态未知的线性离散时间系统的最优预知跟踪控制
5. Optimal tracking control of uncertain systems: On-policy and off-policy reinforcement learning approaches [D] . Modares, Hamidreza 2015

机译：不确定系统的最优跟踪控制：基于策略和基于策略的强化学习方法
6. Design of an Optimal Preview Controller for Linear Discrete-Time Descriptor Noncausal Multirate Systems [O] . Mengjuan Cao, Fucheng Liao -1

机译：线性离散时间非因果多速率系统的最优预览控制器设计
7. Finite-horizon optimal control of discrete-time linear systems with completely unknown dynamics using Q-learning [O] . Jingang Zhao, Chi Zhang 2017

机译：使用Q-Learning完全未知动态的离散时间线性系统的有限视线最优控制
8. Optimal Control Theory of Load-Following and Parameter-Tracking of Nonlinear Systems: An Application of Pontryagin Maximum Principle to Reactor Dynamics [R] . March-Leuba, C. , Perez, R. B. 1987

机译：非线性系统负载跟踪与参数跟踪的最优控制理论 - pontryagin最大值原理在反应堆动力学中的应用

Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics

摘要

著录项

相似文献

相关主题

期刊订阅