首页> 外文会议>Chinese Automation Congress >Online optimal consensus control based on PI algorithm for nonlinear multi-agent system with completely unknown dynamics
【24h】

Online optimal consensus control based on PI algorithm for nonlinear multi-agent system with completely unknown dynamics

机译:动力学完全未知的非线性多智能体系统基于PI算法的在线最优共识控制

获取原文
获取外文期刊封面目录资料

摘要

In this paper, the optimal control of continuous-time nonlinear multi-agent systems with completely unknown dynamic is addressed via adaptive dynamic programming (ADP) technique. A NN identifier with filter variables is used to estimate the unknown dynamic system, and policy iteration (PI) algorithm is employed to get the optimal policies. In the implement of the algorithm, a NN based on data sampling and least-square method is adopted to approximate the value function weights, which can avoid solving the value function. Moreover, the two NNs work simultaneously, that is to say the optimal control policies can be calculated online and the actor NN can be removed. In particular, compared with the existing literatures, the state derivative is not necessary in the sampling process, and the implementation of the algorithm has enhanced. Finally, a simulation is given to illustrate the effectiveness and the whole process of the algorithm.
机译:本文通过自适应动态规划(ADP)技术解决了动态完全未知的连续时间非线性多智能体系统的最优控制问题。使用带有滤波器变量的NN标识符来估计未知的动态系统,并使用策略迭代(PI)算法来获得最佳策略。在该算法的实现中,采用基于数据采样和最小二乘法的神经网络对值函数权重进行近似,避免了求解值函数。此外,两个NN同时工作,也就是说,可以在线计算最佳控制策略,并且可以删除参与者NN。特别是,与现有文献相比,在采样过程中不需要状态导数,并且算法的实现得到了增强。最后,通过仿真来说明算法的有效性和整个过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号