首页> 外文会议>Chinese Control Conference >Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm

【24h】

Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm

机译：使用在线演员批评算法的仿射非线性连续时间系统的最佳控制

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper we propose a new online actor-critic algorithm based on policy iteration for learning the continuous-time optimal control solution with infinite horizon cost for nonlinear systems. In other word, the algorithm solves online an algebraic Riccati equation without knowing the internal dynamics model of the system. This approach is implemented as an actor-critic structure which involves both actor and critic neural networks. Because of using a policy iteration method, the present algorithm alternates between the policy evaluation and policy update steps until an update of the control policy will no longer improve the system performance. Simulation results show the effectiveness of the new algorithm.

机译：本文提出了一种基于政策迭代的新的在线演员 - 批评算法，用于学习与非线性系统无限地平线成本的连续时间最优控制解决方案。换句话说，该算法在在线解决了代数Riccati等式，而不知道系统的内部动力学模型。这种方法实施为演员 - 批评结构，涉及演员和批评神经网络。由于使用策略迭代方法，本算法在策略评估和策略更新步骤之间交替，直到控制策略的更新将不再提高系统性能。仿真结果表明了新算法的有效性。

著录项

来源
《Chinese Control Conference》|2013年||共4页
会议地点
作者
CHEN Xue-song; YANG Ming-sheng; LIU Fu-chun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词
Optimal control; Policy iteration; Actorcritics; Neural networks;

机译：最优控制;政策迭代;恋爱学;神经网络;

相似文献

外文文献
中文文献
专利

1. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem [J] . Kyriakos G. Vamvoudakis, Frank L. Lewis Automatica . 2010,第5期

机译：在线actor-critic算法解决连续时间无限视界最优控制问题
2. Approximate Optimal Control of Affine Nonlinear Continuous-Time Systems Using Event-Sampled Neurodynamic Programming [J] . Neural Networks and Learning Systems, IEEE Transactions on . 2017,第3期

机译：仿射非线性连续时间系统的事件采样神经动力学规划的近似最优控制
3. General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems [J] . Xiao Geyang, Zhang Huaguang, Luo Yanhong, Neurocomputing . 2017,第JULa5期

机译：基于通用值迭代的强化学习，用于求解连续时间仿射非线性系统的最优跟踪控制问题
4. Optimal control of affine nonlinear continuous-time systems using online actor-critic algorithm [C] . Chen Xue-song, Yang Ming-sheng, Liu Fu-chun Chinese Control Conference . 2013

机译：在线仿生批评算法的仿射非线性连续时间系统的最优控制
5. Online adaptive optimal control for continuous-time systems. [D] . Vrabie, Draguna. 2009

机译：连续时间系统的在线自适应最优控制。
6. Gradient Methods on Strongly Convex Feasible Sets and Optimal Control of Affine Systems [O] . V. M. Veliov, P. T. Vuong -1

机译：强凸可行集的梯度方法和仿射系统的最优控制
7. Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics [O] . Lv Yongfeng, Na Jing, Yang Qinmin, 2016

机译：动力学完全未知的连续时间非线性系统的在线自适应最优控制

Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅