首页> 外文期刊>Cybernetics, IEEE Transactions on >Resilient Autonomous Control of Distributed Multiagent Systems in Contested Environments
【24h】

Resilient Autonomous Control of Distributed Multiagent Systems in Contested Environments

机译:竞争环境中分布式多主体系统的弹性自主控制

获取原文
获取原文并翻译 | 示例
           

摘要

An autonomous and resilient controller is proposed for leader-follower multiagent systems under uncertainties and cyber-physical attacks. The leader is assumed nonautonomous with a nonzero control input, which allows changing the team behavior or mission in response to the environmental changes. A resilient learning-based control protocol is presented to find optimal solutions to the synchronization problem in the presence of attacks and system dynamic uncertainties. An observer-based distributed H-infinity controller is first designed to prevent propagating the effects of attacks on sensors and actuators throughout the network, as well as to attenuate the effect of these attacks on the compromised agent itself. Nonhomogeneous game algebraic Riccati equations are derived to solve the H-infinity optimal synchronization problem and off-policy reinforcement learning (RL) is utilized to learn their solution without requiring any knowledge of the agent's dynamics. A trust-confidence-based distributed control protocol is then proposed to mitigate attacks that hijack the entire node and attacks on communication links. A confidence value is defined for each agent based solely on its local evidence. The proposed resilient RL algorithm employs the confidence value of each agent to indicate the trustworthiness of its own information and broadcast it to its neighbors to put weights on the data they receive from it during and after learning. If the confidence value of an agent is low, it employs a trust mechanism to identify compromised agents and remove the data it receives from them from the learning process. The simulation results are provided to show the effectiveness of the proposed approach.
机译:针对不确定性和网络物理攻击下的领导者跟随者多代理系统,提出了一种自治且具有弹性的控制器。假定领导者具有非零控制输入非自治,这允许根据环境变化来更改团队行为或任务。提出了一种基于弹性学习的控制协议,以在存在攻击和系统动态不确定性的情况下找到同步问题的最佳解决方案。首先设计基于观察者的分布式H-infinity控制器,以防止在整个网络上传播攻击对传感器和执行器的影响,并减弱这些攻击对受感染代理本身的影响。推导了非齐次博弈代数Riccati方程来解决H无限最优同步问题,并利用非政策强化学习(RL)来学习其解决方案,而无需任何关于代理人动力学的知识。然后提出了一种基于信任度的分布式控制协议,以减轻劫持整个节点的攻击和对通信链路的攻击。仅基于其本地证据为每个代理定义置信度值。所提出的弹性RL算法利用每个代理的置信度值来指示其自身信息的可信度,并将其广播给其邻居,以权重他们在学习期间和学习后从其接收的数据的权重。如果代理的置信度值低,则它采用信任机制来识别受感染的代理,并从学习过程中删除从代理那里接收到的数据。仿真结果表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号