Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

Xiong Yang; Derong Liu; Ding Wang; Qinglai Wei

首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

【24h】

Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

机译：使用强化学习的一类未知非仿射非线性系统的离散时间在线学习控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a reinforcement-learning-based direct adaptive control is developed to deliver a desired tracking performance for a class of discrete-time (DT) nonlinear systems with unknown bounded disturbances. We investigate multi-input-multi-output unknown nonaffine nonlinear DT systems and employ two neural networks (NNs). By using Implicit Function Theorem, an action NN is used to generate the control signal and it is also designed to cancel the nonlinearity of unknown DT systems, for purpose of utilizing feedback linearization methods. On the other hand, a critic NN is applied to estimate the cost function, which satisfies the recursive equations derived from heuristic dynamic programming. The weights of both the action NN and the critic NN are directly updated online instead of offline training. By utilizing Lyapunov's direct method, the closed-loop tracking errors and the NN estimated weights are demonstrated to be uniformly ultimately bounded. Two numerical examples are provided to show the effectiveness of the present approach.

机译：在本文中，基于增强学习的直接自适应控制被开发来为具有未知边界扰动的一类离散时间（DT）非线性系统提供理想的跟踪性能。我们研究了多输入多输出未知的非仿射非线性DT系统，并采用了两个神经网络（NNs）。通过使用隐函数定理，动作NN用于生成控制信号，并且还设计为消除未知DT系统的非线性，以利用反馈线性化方法。另一方面，将注释器NN用于估计成本函数，该函数满足从启发式动态规划派生的递归方程。动作NN和评论者NN的权重都直接在线更新，而不是离线训练。利用李雅普诺夫的直接方法，证明了闭环跟踪误差和神经网络估计权重最终是一致的。提供了两个数值示例，以显示本方法的有效性。

著录项

来源
《Neural Networks: The Official Journal of the International Neural Network Society》 |2014年第null期|共12页
作者
Xiong Yang; Derong Liu; Ding Wang; Qinglai Wei;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学;
关键词

相似文献

外文文献
中文文献
专利

1. Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning [J] . Xiong Yang, Derong Liu, Ding Wang, Neural Networks: The Official Journal of the International Neural Network Society . 2014,第Null期

机译：使用强化学习的一类未知非仿射非线性系统的离散时间在线学习控制
2. Reinforcement learning-based online adaptive controller design for a class of unknown nonlinear discrete-time systems with time delays [J] . Liang Yuling, Zhang Huaguang, Xiao Geyang, Neural computing & applications . 2018,第6期

机译：基于延迟的一类未知非线性离散时间系统的加固基于学习的在线自适应控制器设计
3. Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks [J] . Yang Q., Vance J. B., Jagannathan S. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2008,第4期

机译：基于强化学习的线性参数化神经网络控制非仿射非线性离散系统
4. Online Reinforcement Learning Control of Unknown Nonaffine Nonlinear Discrete Time Systems [C] . Qinmin Yang, S. Jagannathan IEEE Conference on Decision and Control . 2007

机译：未知非共和非线性离散时间系统的在线增强学习控制
5. Neural network control of nonstrict feedback and nonaffine nonlinear discrete-time systems with application to engine control. [D] . Vance, Jonathan Blake. 2007

机译：非严格反馈和非仿射非线性离散时间系统的神经网络控制及其在发动机控制中的应用。
6. Learning from Simple Ebooks Online Cases or Classroom Teaching When Acquiring Complex Knowledge. A Randomized Controlled Trial in Respiratory Physiology and Pulmonology [O] . Bjarne Skjødt Worm -1

机译：在学习复杂知识时可以从简单的电子书在线案例或课堂教学中学习。呼吸生理学和肺病学的随机对照试验
7. Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks [O] . Qinmin Yang, Student Member, Jonathan Blake Vance, 2013

机译：基于强化学习的线性参数化神经网络控制非仿射非线性离散系统

Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅