首页> 外文会议>Annual American Control Conference >Output Feedback Reinforcement Learning Control for the Continuous-Time Linear Quadratic Regulator Problem

【24h】

Output Feedback Reinforcement Learning Control for the Continuous-Time Linear Quadratic Regulator Problem

机译：连续时间线性二次调节器问题的输出反馈强化学习控制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present an output feedback reinforcement learning scheme to solve the LQR problem for continuous-time linear systems. The problem consists of finding the optimal feedback gain to achieve asymptotic stability without the knowledge of system dynamics and the information of the full state. An output feedback policy iteration algorithm is proposed that iteratively solves the ADP Bellman equation to find the optimal control parameters. Unlike the existing methods, the proposed scheme does not require any discrete approximation, and is not affected by the excitation noise bias. As a result, the need of a discounting factor, which has been a bottleneck in the past in achieving stability guarantee, is eliminated. The learned control parameters are optimal and match exactly the solution of the LQR Riccati equation. Simulation results show the effectiveness of the proposed scheme.

机译：在本文中，我们提出了一种输出反馈强化学习方案，以解决连续时间线性系统的LQR问题。问题在于寻找最佳反馈增益以实现渐近稳定性，而无需了解系统动力学和全状态信息。提出了一种输出反馈策略迭代算法，该算法迭代求解ADP Bellman方程以找到最佳控制参数。与现有方法不同，所提出的方案不需要任何离散近似，并且不受激励噪声偏置的影响。结果，消除了过去一直是实现稳定性保证的瓶颈的折现因子的需要。学习的控制参数是最佳的，并且与LQR Riccati方程的解完全匹配。仿真结果表明了该方案的有效性。

著录项

来源
《Annual American Control Conference 》|2018年|3417-3422|共6页
会议地点 Milwaukee(US)
作者
Syed Ali Asad Rizvi; Zongli Lin;
展开▼
作者单位

University of Virginia Department of Electrical and Computer Engineering Charlottesville VA 22904-4743 USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Output feedback; Mathematical model; System dynamics; Learning (artificial intelligence); Cost function; Observers; Stability analysis;

机译：输出反馈；数学模型;系统动力学；学习（人工智能）；成本函数；观察员；稳定性分析;

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning-Based Linear Quadratic Regulation of Continuous-Time Systems Using Dynamic Output Feedback [J] . Rizvi Syed Ali Asad, Lin Zongli Cybernetics, IEEE Transactions on . 2020 ,第11期

机译：使用动态输出反馈的加固基于学习的线性二次调节连续时间系统
2. Optimal Output-Feedback Control of Unknown Continuous-Time Linear Systems Using Off-policy Reinforcement Learning [J] . Hamidreza Modares, Frank L. Lewis, Zhong-Ping Jiang Cybernetics, IEEE Transactions on . 2016 ,第11期

机译：基于非策略强化学习的未知连续时间线性系统的最优输出反馈控制
3. Output-feedback H_∞ quadratic tracking control of linear systems using reinforcement learning [J] . Moghadam Rohollah, Lewis Frank L. International Journal of Adaptive Control and Signal Processing . 2019 ,第2期

机译：基于强化学习的线性系统输出反馈H_∞二次跟踪控制
4. Output Feedback Reinforcement Learning Control for the Continuous-Time Linear Quadratic Regulator Problem [C] . Syed Ali Asad Rizvi, Zongli Lin American Control Conference . 2018

机译：输出连续时间线性二次调节器问题的反馈增强学习控制
5. Linear Quadratic Tracking Based on Reinforcement Learning and Motor Speed Control Without System Dynamics [D] . Tang, Shuo. 2020

机译：基于钢筋学习的线性二次跟踪，电机速度控制，无系统动态
6. Fuzzy ... formula ... output-feedback control for the discrete-time system with channel fadings sector nonlinearities and randomly occurring interval delays and nonlinearities [O] . Xiaozheng Fan, Yan Wang, Manfeng Hu -1

机译：具有信道衰落扇区非线性以及随机出现的间隔延迟和非线性的离散时间系统的模糊...公式输出反馈控制
7. The Quadratic-Quadratic Regulator Problem: Approximating feedback controls for quadratic-in-state nonlinear systems [O] . Jeff Borggaard, Lizette Zietsman 2020

机译：二次二次稳压器问题：近似反馈控制对正状态的非线性系统
8. Output Feedback Pole-Placement in the Design of Compensators for Suboptimal Linear Quadratic Regulators [R] . Hopkins, W. E. 1979

机译：次优线性二次型调节器补偿器设计中的输出反馈极点配置

Output Feedback Reinforcement Learning Control for the Continuous-Time Linear Quadratic Regulator Problem

摘要

著录项

相似文献

相关主题

期刊订阅