Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems

Chen Ci; Lewis Frank L.; Xie Kan; Xie Shengli; Liu Yilu

首页> 外文期刊>Automatica >Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems

【24h】

Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems

机译：异构多代理系统自适应最优输出同步的禁止策略学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an off-policy learning-based dynamic state feedback protocol that achieves the optimal synchronization of heterogeneous multi-agent systems (MAS) over a directed communication network. Note that most of the recent works on heterogeneous MAS are not formed in an optimal manner. By formulating the cooperative output regulation problem as an H-infinity optimization problem, we can use reinforcement learning to find output synchronization protocols online along with the system trajectories without solving output regulator equations. In contrast to the existing optimal literature where leader's states are assumed to be globally or distributively available for the communication, we only allow the relative system outputs to transmit through the network; namely, no leader's states are needed now for the control or learning purpose. (C) 2020 Elsevier Ltd. All rights reserved.

机译：本文提出了一种基于促进基于策略的动态状态反馈协议，实现了在定向通信网络上的异构多代理系统（MAS）的最佳同步。请注意，最近的大多数在异构MAS上的工作不是以最佳方式形成的。通过将协作输出调节问题作为H-Infinity优化问题，我们可以使用强化学习在线查找输出同步协议以及系统轨迹而不解决输出调节器方程。与现有的最佳文学相比，假设领导者的国家被全局或分布式可用于通信，我们只允许相对系统输出通过网络传输; 即，现在不需要领导者的国家来控制或学习目的。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Automatica》 |2020年第1期|共7页
作者
Chen Ci; Lewis Frank L.; Xie Kan; Xie Shengli; Liu Yilu;
展开▼
作者单位

Guangdong Univ Technol Sch Automat Guangdong Key Lab IoT Informat Technol Guangzhou Peoples R China;

Univ Texas Arlington UTA Res Inst Ft Worth TX USA;

Guangdong Univ Technol Sch Automat Guangdong Key Lab IoT Informat Technol Guangzhou Peoples R China;

Guangdong Univ Technol Sch Automat Guangdong Key Lab IoT Informat Technol Guangzhou Peoples R China;

Univ Tennessee Dept Elect Engn &

Comp Sci Knoxville TN USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术及设备;
关键词
Off-policy learning; Adaptive dynamic programming; Reinforcement learning; Optimal synchronization; Heterogeneous multi-agent system;

机译：禁止学习;自适应动态规划;加固学习;最优同步;异构多智能体系;
入库时间 2022-08-20 01:42:31

相似文献

外文文献
中文文献
专利

1. Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems [J] . Chen Ci, Lewis Frank L., Xie Kan, Automatica . 2020,第1期

机译：异构多代理系统自适应最优输出同步的禁止策略学习
2. Output feedback reinforcement learning based optimal output synchronisation of heterogeneous discrete-time multi-agent systems [J] . Rizvi Syed Ali Asad, Lin Zongli Control Theory & Applications, IET . 2019,第17期

机译：基于输出反馈强化学习的异构离散多智能体系统最优输出同步
3. Bipartite output synchronization of heterogeneous time-varying multi-agent systems via edge-based adaptive protocols [J] . Liang Qingpeng, Wu Yanzhi, Hu Jiangping, Journal of the Franklin Institute . 2020,第17期

机译：基于边缘的自适应协议的非均匀时变多助理系统的二分输出同步
4. Off-policy reinforcement learning for distributed output synchronization of linear multi-agent systems [C] . Bahare Kiumarsi, Frank L. Lewis IEEE Symposium Series on Computational Intelligence . 2017

机译：线性多主体系统分布式输出同步的非策略强化学习
5. Optimal tracking control of uncertain systems: On-policy and off-policy reinforcement learning approaches [D] . Modares, Hamidreza 2015

机译：不确定系统的最优跟踪控制：基于策略和基于策略的强化学习方法
6. On-Demand Channel Bonding in Heterogeneous WLANs: A Multi-Agent Deep Reinforcement Learning Approach [O] . Hang Qi, Hao Huang, Zhiqun Hu, 2020

机译：异构WLAN中的按需信道绑定：多代理深度强化学习方法
7. Adaptive synchronization of heterogeneous multi-agent systems: A free observer approach [O] . Miguel F. Arevalo-Castiblanco, Duvan Tellez-Castro, Jorge Sofrony, 2020

机译：异构多剂量系统的自适应同步：免费观察者方法

Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems

摘要

著录项

相似文献

相关主题

期刊订阅