首页> 外文会议>Chinese Automation Congress >Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics

【24h】

Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics

机译：非策略强化学习，用于动态未知的线性离散时间系统的最优预知跟踪控制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper., an off-policy reinforcement learning (RL) algorithm is presented to solve the optimal preview tracking control of discrete time systems with unknown dynamics. Firstly., an augmented state-space system that includes the available preview knowledge as a part of the state vector is constructed to cast the preview tracking control problem as a standard linear quadratic regulator (LQR) one. Secondly., the reinforcement learning technique is utilized to solve the algebraic Riccati equation (ARE) using online measurable data without requiring the a priori knowledge of the system matrices. Compared with the existing off-policy RL algorithm., the proposed scheme solves a preview tracking control problem. A numerical simulation example is given to verify the effectiveness of the proposed control scheme.

机译：本文提出了一种非策略强化学习算法，以解决未知动态离散时间系统的最优预知跟踪控制问题。首先，构造了包括状态信息的一部分在内的可用预览知识的增强状态空间系统，以将预览跟踪控制问题转换为标准线性二次调节器（LQR）。其次，强化学习技术被用于使用在线可测量数据来求解代数Riccati方程（ARE），而无需先验系统矩阵知识。与现有的非策略RL算法相比，该方案解决了预览跟踪控制问题。数值算例验证了所提控制方案的有效性。

著录项

来源
《Chinese Automation Congress》|2018年|1402-1407|共6页
会议地点
作者
Chao-Ran Wang; Huai-Ning Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Discrete-time systems; Reinforcement learning; Heuristic algorithms; Mathematical model; Symmetric matrices; Optimal control; System dynamics;

机译：离散时间系统;强化学习;启发式算法;数学模型;对称矩阵;最优控制;系统动力学;
入库时间 2022-08-26 14:45:07

相似文献

外文文献
中文文献
专利

1. Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics [J] . Bahare Kiumarsi, Frank L. Lewis, Hamidreza Modares, Automatica . 2014,第4期

机译：增强Q学习，用于动态未知的线性离散时间系统的最优跟踪控制
2. Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning [J] . Optimal Control Applications and Methods . 2020,第4期

机译：通过截止策略强化学习对线性离散时间系统非零和游戏的最佳跟踪控制
3. Synchronous optimal control method for nonlinear systems with saturating actuators and unknown dynamics using off-policy integral reinforcement learning [J] . Zhang Zenglian, Song Ruizhuo, Cao Min Neurocomputing . 2019,第SEPa3期

机译：带有饱和执行器和未知动力学的非线性系统的同步最优控制方法
4. Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics [C] . Chao-Ran Wang, Huai-Ning Wu Chinese Automation Congress . 2018

机译：具有未知动力学的线性离散时间系统的最佳预览跟踪控制的禁止策略加强学习
5. Optimal tracking control of uncertain systems: On-policy and off-policy reinforcement learning approaches [D] . Modares, Hamidreza 2015

机译：不确定系统的最优跟踪控制：基于策略和基于策略的强化学习方法
6. Design of an Optimal Preview Controller for Linear Discrete-Time Descriptor Noncausal Multirate Systems [O] . Mengjuan Cao, Fucheng Liao -1

机译：线性离散时间非因果多速率系统的最优预览控制器设计
7. Finite-horizon optimal control of discrete-time linear systems with completely unknown dynamics using Q-learning [O] . Jingang Zhao, Chi Zhang 2017

机译：使用Q-Learning完全未知动态的离散时间线性系统的有限视线最优控制
8. Optimal Control Theory of Load-Following and Parameter-Tracking of Nonlinear Systems: An Application of Pontryagin Maximum Principle to Reactor Dynamics [R] . March-Leuba, C. , Perez, R. B. 1987

机译：非线性系统负载跟踪与参数跟踪的最优控制理论 - pontryagin最大值原理在反应堆动力学中的应用

Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics

摘要

著录项

相似文献

相关主题

期刊订阅