首页> 外文会议>Annual American Control Conference >Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking

【24h】

Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking

机译：基于合作模型的加强学习，用于近似最佳跟踪

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for a set of agents with homogeneous dynamics and common tracking objectives. Model-based reinforcement learning is implemented by simultaneously evaluating the Bellman error (BE) at the state of each agent and on nearby off-trajectory points, as needed, throughout the state space. Each agent will calculate and share their respective on and off-trajectory BE information with a centralized estimator, which computes updates for the approximate solution to the infinite-horizon optimal tracking problem and shares the estimate with the agents. In doing so, the computational burden associated with BE extrapolation is shared between the agents and a centralized updating resource. Edge computing is leveraged to share the computational load between the agents and a centralized resource. Uniformly ultimately bounded tracking of each agent's state to the desired state and convergence of the control policy to the neighborhood of the optimal policy is proven via a Lyapunov-like stability analysis.

机译：本文提供了一组具有均匀动态和常见跟踪目标的一组代理的无限地平线最佳跟踪问题的近似的在线自适应解决方案。基于模型的增强学习是通过同时评估每个代理的状态的Bellman误差（BES），并根据需要在整个状态空间内的附近的离轨点。每个代理将计算并共享各自的开和脱轨，以集中估计器提供信息，该信息将对无限范围最佳跟踪问题的近似解的更新计算，并与代理共享估计。在这样做时，在代理和集中更新资源之间共享与外推相关的计算负担。利用边缘计算以在代理和集中资源之间共享计算负载。通过Lyapunov样稳定性分析证明，通过Lyapunov的稳定性分析证明了对每个代理的状态和控制策略的所需状态和控制策略的趋同的均匀偏移跟踪。

著录项

来源
《Annual American Control Conference》|2021年|1973-1978|共6页
会议地点
作者
Max L. Greene; Zachary I. Bell; Scott A. Nivison; Jonathan P. How; Warren E. Dixon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Extrapolation; Adaptation models; Computational modeling; Optimal control; Reinforcement learning; Stability analysis; Trajectory;

机译：推断;适应模型;计算建模;最优控制;加固学习;稳定性分析;轨迹;

相似文献

外文文献
中文文献
专利

1. Model-Based Reinforcement Learning for Infinite-Horizon Approximate Optimal Tracking [J] . Rushikesh Kamalapurkar, Lindsey Andrews, Patrick Walters, Neural Networks and Learning Systems, IEEE Transactions on . 2017,第3期

机译：基于模型的无限水平近似最优跟踪强化学习
2. Model-based reinforcement learning for approximate optimal regulation [J] . Kamalapurkar Rushikesh, Walters Patrick, Dixon Warren E. Automatica . 2016,第Null期

机译：基于模型的强化学习，用于近似最优调节
3. Efficient model-based reinforcement learning for approximate online optimal control [J] . Kamalapurkar Rushikesh, Rosenfeld Joel A., Dixon Warren E. Automatica . 2016,第Null期

机译：基于模型的高效强化学习，用于近似在线最优控制
4. Model-based reinforcement learning for infinite-horizon approximate optimal tracking [C] . Kamalapurkar Rushikesh, Andrews Lindsey, Walters Patrick, IEEE Annual Conference on Decision and Control . 2014

机译：基于模型的强化学习，用于无限水平近似最优跟踪
5. Model-Based Reinforcement Learning for Cooperative Multi-Agent Planning: Exploiting Hierarchies, Bias, and Temporal Sampling [D] . Ma, Aaron. 2020

机译：基于模型的合作多智能经纪人规划的强化学习：利用层次结构，偏见和时间采样
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Model-based reinforcement learning for infinite-horizon approximate optimal tracking [O] . Kamalapurkar, Rushikesh, Andrews, Lindsey, Walters, Patrick, 2015

机译：基于模型的无限近似近似强化学习最佳跟踪

Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking

摘要

著录项

相似文献

相关主题

期刊订阅