Nearly data-based optimal control for linear discrete model-free systems with delays via reinforcement learning

Zhang Jilie; Zhang Huaguang; Wang Binrui; Cai Tiaoyang

首页> 外文期刊>International journal of systems science >Nearly data-based optimal control for linear discrete model-free systems with delays via reinforcement learning

【24h】

Nearly data-based optimal control for linear discrete model-free systems with delays via reinforcement learning

机译：通过强化学习对带有延迟的线性离散无模型系统的基于数据的最优控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a nearly data-based optimal control scheme is proposed for linear discrete model-free systems with delays. The nearly optimal control can be obtained using only measured input/output data from systems, by reinforcement learning technology, which combines Q-learning with value iterative algorithm. First, we construct a state estimator by using the measured input/output data. Second, the quadratic functional is used to approximate the value function at each point in the state space, and the data-based control is designed by Q-learning method using the obtained state estimator. Then, the paper states the method, that is, how to solve the optimal inner kernel matrix (P) over bar in the least-square sense, by value iteration algorithm. Finally, the numerical examples are given to illustrate the effectiveness of our approach.

机译：本文针对具有延迟的线性离散无模型系统，提出了一种基于数据的最优控制方案。通过将Q学习与值迭代算法结合在一起的强化学习技术，仅使用系统中测得的输入/输出数据即可获得几乎最佳的控制。首先，我们使用测得的输入/输出数据构造一个状态估计器。其次，使用二次函数对状态空间中每个点的值函数进行逼近，并使用获得的状态估计器通过Q学习方法设计基于数据的控制。然后，阐述了该方法，即如何通过值迭代算法在最小二乘意义上求解最优的内部核矩阵（P）。最后，通过数值算例说明了该方法的有效性。

著录项

来源
《International journal of systems science》 |2016年第8期|1563-1573|共11页
作者
Zhang Jilie; Zhang Huaguang; Wang Binrui; Cai Tiaoyang;
展开▼
作者单位

Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Peoples R China;

Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Peoples R China|Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China;

China Jiliang Univ, Coll Mech & Elect Engn, Hangzhou 310018, Zhejiang, Peoples R China;

Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
data-based optimal control; linear discrete time-delay system; model-free system; Q-learning; value iteration; reinforcement learning;

机译：基于数据的最优控制;线性离散时滞系统;无模型系统;Q学习;值迭代;强化学习;

相似文献

外文文献
中文文献
专利

1. Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method [J] . Jiang He, Zhang Huaguang, Luo Yanhong, Neurocomputing . 2016,第juna19期

机译：基于数据的强化学习方法对完全未知的非线性离散时间马尔可夫跳跃系统的最优跟踪控制
2. Model-free optimal control design for a class of linear discrete-time systems with multiple delays using adaptive dynamic programming [J] . Jilie Zhang, Huaguang Zhang, Yanhong Luo, Neurocomputing . 2014,第jula5期

机译：使用自适应动态规划的一类多延迟线性离散时间系统的无模型最优控制设计
3. Data-based reinforcement learning approximate optimal control for an uncertain nonlinear system with control effectiveness faults [J] . Deptula Patryk, Bell Zachary I, Doucette Emily A., Automatica . 2020,第1期

机译：基于数据的增强学习对控制效果故障的不确定非线性系统的近似最优控制
4. Reinforcement learning-based optimal control considering L computation time delay of linear discrete-time systems [C] . Fujita Takashi, Ushio Toshimitu IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning . 2014

机译：考虑线性离散时间系统L计算时延的基于强化学习的最优控制
5. Data-Based Reinforcement Learning: Approximate Optimal Control for Uncertain Nonlinear Systems [D] . ?Deptu?a, Patryk 2019

机译：基于数据的强化学习：不确定非线性系统的近似最优控制
6. Control of neural systems at multiple scales using model-free deep reinforcement learning [O] . B. A. Mitchell, L. R. Petzold -1

机译：使用无模型的深度强化学习以多尺度控制神经系统
7. Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks [O] . Qinmin Yang, Student Member, Jonathan Blake Vance, 2013

机译：基于强化学习的线性参数化神经网络控制非仿射非线性离散系统
8. Model-Free Control of Nonlinear Stochastic Systems in Discrete Time [R] . Spall, J. C., Cristion, J. A. 1995

机译：离散时间内非线性随机系统的无模型控制

Nearly data-based optimal control for linear discrete model-free systems with delays via reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅