Model-based reinforcement learning for infinite-horizon approximate optimal tracking

机译：基于模型的强化学习，用于无限水平近似最优跟踪

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for control-affine continuous-time nonlinear systems with unknown drift dynamics where model-based reinforcement learning is used to relax the persistence of excitation condition. Model-based reinforcement learning is implemented using a concurrent learning-based system identifier to simulate experience by evaluating the Bellman error over unexplored areas of the state space. Tracking of the desired trajectory and convergence of the developed policy to a neighborhood of the optimal policy is established via Lyapunov-based stability analysis.

机译：本文针对具有未知漂移动力学的仿射连续时间非线性系统的无限水平最优跟踪问题，提供了一种近似的在线自适应解决方案，其中基于模型的强化学习用于放松激励条件的持续性。使用基于并发的基于学习的系统标识符来实现基于模型的强化学习，以通过评估状态空间未探索区域上的Bellman错误来模拟体验。通过基于Lyapunov的稳定性分析，可以跟踪期望的轨迹并将已开发策略收敛到最优策略的邻域。

著录项

来源
《IEEE Annual Conference on Decision and Control》|2014年|5083-5088|共6页
会议地点
作者
Kamalapurkar Rushikesh; Andrews Lindsey; Walters Patrick; Dixon Warren E.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Lyapunov methods; approximation theory; continuous time systems; convergence of numerical methods; learning (artificial intelligence); nonlinear control systems; stability; state-space methods; Bellman error; Lyapunov-based stability analysis; approximate online adaptive solution; concurrent learning-based system identifier; control-affine continuous-time nonlinear systems; excitation condition; infinite-horizon approximate optimal tracking; model-based reinforcement learning; optimal policy; state space; trajectory tracking; unknown drift dynamics; Artificial neural networks; Function approximation; Optimal control; Stability analysis; Steady-state; Trajectory;

机译：Lyapunov方法;逼近理论;连续时间系统;数值方法的收敛性;学习（人工智能）;非线性控制系统;稳定性;状态空间方法; Bellman误差;基于Liyapunov的稳定性分析;近似在线自适应解决方案;基于并行学习系统标识符;仿射连续时间非线性系统;激励条件;无限水平近似最优跟踪;基于模型的强化学习;最优策略;状态空间;轨迹跟踪;未知漂移动力学;人工神经网络;函数逼近;最优控制;稳定性分析;稳态;轨迹;

相似文献

外文文献
中文文献
专利

1. Model-Based Reinforcement Learning for Infinite-Horizon Approximate Optimal Tracking [J] . Rushikesh Kamalapurkar, Lindsey Andrews, Patrick Walters, Neural Networks and Learning Systems, IEEE Transactions on . 2017,第3期

机译：基于模型的无限水平近似最优跟踪强化学习
2. Model-based reinforcement learning for approximate optimal regulation [J] . Kamalapurkar Rushikesh, Walters Patrick, Dixon Warren E. Automatica . 2016,第Null期

机译：基于模型的强化学习，用于近似最优调节
3. Efficient model-based reinforcement learning for approximate online optimal control [J] . Kamalapurkar Rushikesh, Rosenfeld Joel A., Dixon Warren E. Automatica . 2016,第Null期

机译：基于模型的高效强化学习，用于近似在线最优控制
4. Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking [C] . Max L. Greene, Zachary I. Bell, Scott A. Nivison, Annual American Control Conference . 2021

机译：基于合作模型的加强学习，用于近似最佳跟踪
5. Optimal tracking control of uncertain systems: On-policy and off-policy reinforcement learning approaches [D] . Modares, Hamidreza 2015

机译：不确定系统的最优跟踪控制：基于策略和基于策略的强化学习方法
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Model-based reinforcement learning for infinite-horizon approximate optimal tracking [O] . Kamalapurkar, Rushikesh, Andrews, Lindsey, Walters, Patrick, 2015

机译：基于模型的无限近似近似强化学习最佳跟踪

Model-based reinforcement learning for infinite-horizon approximate optimal tracking

摘要

著录项

相似文献

相关主题

期刊订阅