Learning to Control First Order Linear Systems with Discrete Time Reinforcement Learning

机译：学习用离散时间加固学习控制一阶线性系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) is a powerful method for learning policies in environments with delayed feedback. It is typically used to learn a control policy on systems with an unknown model. Ideally, it would be desirable to apply RL to learning controllers for first-order linear systems (FOLS), which are used to model many processes in Cyber Physical Systems. However, a challenge in using RL techniques in FOLS is dealing with the mismatch between the continuous-time modeling in the linear-systems framework and the discrete-time perspective of classical RL. In this paper, we show that the optimal continuous-time value function can be approximated as a linear combination over a set of quadratic basis functions, the coefficients of which can be learned in a model-free way by methods such as Q-learning. In addition, we show that the performance of the learned controller converges to the performance of the optimal continuous-time controller as the step-size approaches zero.

机译：强化学习（RL）是一种强大的方法，用于在具有延迟反馈的环境中学习策略。它通常用于在具有未知模型的系统上学习控制策略。理想地，希望将RL应用于用于一阶线性系统（FOLS）的学习控制器，其用于在网络物理系统中建模许多过程。然而，在Linear-Systems框架中使用rl技术的挑战在线处理了线性系统框架中的连续时间建模和分立时间视角的不匹配。在本文中，我们表明，最佳连续时间值函数可以近似为一组二次基础函数，其系数可以通过诸如Q学习的方法以无模型方式学习。此外，我们表明，学习控制器的性能会收敛到最佳连续时间控制器的性能，因为梯度尺寸接近零。

著录项

来源
《International Conference on Embedded Systems, Cyber-physical Systems, and Applications》|2016年|184p|共7页
会议地点
作者
Eric Nelson; Thomas Ioerger;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems [J] . Liu Y., Tang L., Tong S., Neural Networks and Learning Systems, IEEE Transactions on . 2015,第1期

机译：非线性离散MIMO系统中基于强化学习设计且学习参数较少的自适应跟踪控制
2. Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning [J] . Xiong Yang, Derong Liu, Ding Wang, Neural Networks: The Official Journal of the International Neural Network Society . 2014,第Null期

机译：使用强化学习的一类未知非仿射非线性系统的离散时间在线学习控制
3. Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning [J] . Xiong Yang, Derong Liu, Ding Wang, Neural Networks: The Official Journal of the International Neural Network Society . 2014,第Null期

机译：采用加固学习的一类未知非共和非线性系统的离散时间在线学习控制
4. Learning to Control First Order Linear Systems with Discrete Time Reinforcement Learning [C] . Eric Nelson, Thomas Ioerger International Conference on Embedded Systems, Cyber-physical Systems, and Applications . 2016

机译：学习用离散时间加固学习控制一阶线性系统
5. New Stable Inverses of Linear Discrete Time Systems and Application to Iterative Learning Control [D] . Ji, Xiaoqiang . 2019

机译：线性离散时间系统的新稳定逆转，应用于迭代学习控制
6. Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning [O] . JoonBum Leem, Ha Young Kim, Baogui Xin, 2020

机译：采用深度加固学习采用延长离散动作空间的行动专业专业专家集合交易系统
7. Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks [O] . Qinmin Yang, Student Member, Jonathan Blake Vance, 2013

机译：基于强化学习的线性参数化神经网络控制非仿射非线性离散系统

Learning to Control First Order Linear Systems with Discrete Time Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅