Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

Guo Lei; Zhao Han

首页> 外文期刊>Neurocomputing >Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

【24h】

Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

机译：Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

In this study, we present a novel algorithm, based on synchronous policy iteration, to solve the continuous-time infinite-horizon optimal control problem of input affine system dynamics. The integral reinforcement is measured as an excitation signal to estimate the solution to the Hamilton-Jacobi-Bell man equation. In addition, the proposed method is completely model-free, that is, no a priori knowledge of the system is required. Using the adaptive tuning law, the actor and critic neural networks can simultaneously approximate the optimal value function and policy. The persistence of excitation condition is required to guarantee the convergence of the two networks. Unlike in traditional policy iteration algorithms, the restriction of the initial admissible policy was eliminated using this method. The effectiveness of the proposed algorithm is verified through numerical simulations. (c) 2022 Elsevier B.V. All rights reserved.

著录项

来源
《Neurocomputing》 |2023年第1期|250-261|共12页
作者
Guo Lei; Zhao Han;
展开▼
作者单位

Beijing Univ Posts & Telecommun;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Reinforcement learning; Neural networks; Adaptive control; Actor -critic; Explorations;

Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

摘要

著录项

相关主题

期刊订阅