Comprehensive comparison of online ADP algorithms for continuous-time optimal control

Zhu Yuanheng; Zhao Dongbin

首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >Comprehensive comparison of online ADP algorithms for continuous-time optimal control

【24h】

Comprehensive comparison of online ADP algorithms for continuous-time optimal control

机译：在线ADP算法进行全面比较连续时间最优控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Online learning is an important property of adaptive dynamic programming (ADP). Online observations contain plentiful dynamics information, and ADP algorithms can utilize them to learn the optimal control policy. This paper reviews the research of online ADP algorithms for the optimal control of continuous-time systems. With the intensive study, ADP has been developed towards model free and data efficient. After separately introducing the algorithms, we compare their performance on the same problem. This paper is desired to provide a comprehensive understanding of continuous-time online ADP algorithms.

机译：在线学习是自适应动态编程（ADP）的重要属性。在线观察包含丰富的动态信息，ADP算法可以利用它们来学习最佳控制策略。本文综述了在线ADP算法的研究，以实现连续时间系统的最佳控制。随着密集的研究，ADP已经开发了模型自由和数据效率。单独介绍算法后，我们将其性能与同一问题进行比较。希望本文能够全面了解连续时间在线ADP算法。

著录项

来源
《Artificial Intelligence Review: An International Science and Engineering Journal》 |2018年第4期|共17页
作者
Zhu Yuanheng; Zhao Dongbin;
展开▼
作者单位

Chinese Acad Sci Inst Automat State Key Lab Management &

Control Complex Syst Beijing 100190 Peoples R China;

Chinese Acad Sci Inst Automat State Key Lab Management &

Control Complex Syst Beijing 100190 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
Adaptive dynamic programming; Policy iteration; Integral reinforcement learning; Experience replay; Off-policy;

机译：自适应动态规划;政策迭代;整体加强学习;体验重播;休息;

相似文献

外文文献
中文文献
专利

1. Comprehensive comparison of online ADP algorithms for continuous-time optimal control [J] . Zhu Yuanheng, Zhao Dongbin Artificial Intelligence Review: An International Science and Engineering Journal . 2018,第4期

机译：在线ADP算法进行全面比较连续时间最优控制
2. Online adaptive optimal control for continuous-time Markov jump linear systems using a novel policy iteration algorithm [J] . Shuping He, Jun Song, Zhengtao Ding, Control Theory & Applications, IET . 2015,第10期

机译：基于新型策略迭代算法的连续时间马尔可夫跳跃线性系统的在线自适应最优控制
3. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem [J] . Kyriakos G. Vamvoudakis, Frank L. Lewis Automatica . 2010,第5期

机译：在线actor-critic算法解决连续时间无限视界最优控制问题
4. Online policy iteration based algorithms to solve the continuous-time infinite horizon optimal control problem [C] . Vamvoudakis K., Vrabie D., Lewis F. Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL '09 . 2009

机译：基于在线策略迭代的算法来解决连续时间无限期最优控制问题
5. Online adaptive optimal control for continuous-time systems. [D] . Vrabie, Draguna. 2009

机译：连续时间系统的在线自适应最优控制。
6. Optimal Parameter Exploration for Online Change-Point Detection in Activity Monitoring Using Genetic Algorithms [O] . Naveed Khan, Sally McClean, Shuai Zhang, 2014

机译：基于遗传算法的活动监测在线变化点检测的最优参数探索
7. Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics [O] . Lv Yongfeng, Na Jing, Yang Qinmin, 2016

机译：动力学完全未知的连续时间非线性系统的在线自适应最优控制
8. Comparison of Several Gradient Algorithms for Optimal Control Problems [R] . Miele, A., Tietze, J. L., Levy, A. V. 1972

机译：几种梯度算法在最优控制问题中的比较

Comprehensive comparison of online ADP algorithms for continuous-time optimal control

摘要

著录项

相似文献

相关主题

期刊订阅