Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

Nourian M.; Caines P. E.; Malhame R. P.; Huang M.

首页> 外文期刊>Automatic Control, IEEE Transactions on >Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

【24h】

Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

机译：跟随者随机多智能体系统中的平均场LQG控制：基于似然比的自适应

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We study large population leader-follower stochastic multi-agent systems where the agents have linear stochastic dynamics and are coupled via their quadratic cost functions. The cost of each leader is based on a trade-off between moving toward a certain reference trajectory which is unknown to the followers and staying near their own centroid. On the other hand, followers react by tracking a convex combination of their own centroid and the centroid of the leaders. We approach this large population dynamic game problem by use of so-called Mean Field (MF) linear-quadratic-Gaussian (LQG) stochastic control theory. In this model, followers are adaptive in the sense that they use a likelihood ratio estimator (on a sample population of the leaders' trajectories) to identify the member of a given finite class of models which is generating the reference trajectory of the leaders. Under appropriate conditions, it is shown that the true reference trajectory model is identified by each follower in finite time with probability one as the leaders' population goes to infinity. Furthermore, we show that the resulting sets of mean field control laws for both leaders and adaptive followers possess an almost sure $varepsilon_{N}$-Nash equilibrium property for a system with population $N$ where $varepsilon_{N}$ goes to zero as $N$ goes to infinity. Numerical experiments are presented illustrating the results.

机译：我们研究了人口众多的跟随者跟随随机多智能体系统，这些智能体具有线性随机动力学并通过其二次成本函数进行耦合。每个领导者的成本是基于朝着跟随者未知的某个参考轨迹与保持靠近自己的质心之间的权衡。另一方面，跟随者通过跟踪自己的质心和领导者质心的凸组合来做出反应。我们通过使用所谓的平均场（MF）线性二次高斯（LQG）随机控制理论来解决这一大型种群动态博弈问题。在此模型中，跟随者是自适应的，即跟随者使用似然比估计值（基于领导者轨迹的样本）来识别给定有限类模型的成员，该模型正在生成领导者的参考轨迹。结果表明，在适当的条件下，随着领导者人数的增加，每个跟随者在有限的时间内都可以识别出真实的参考轨迹模型，概率为1。此外，我们表明，对于具有种群$ N $的系统（其中$ varepsilon_ {N} $进入）的系统，对于领导者和适应性跟随者而言，所得的平均场控制定律集几乎具有肯定的$ varepsilon_ {N} $-Nash均衡性质。 $ N $变为无穷大时为零。数值实验表明了结果。

著录项

来源
《Automatic Control, IEEE Transactions on》 |2012年第11期|p.2801-2816|共16页
作者
Nourian M.; Caines P. E.; Malhame R. P.; Huang M.;
展开▼
作者单位

Centre for Intelligent Machines (CIM) and the Department of Electrical & Computer Engineering, McGill University, Montreal, Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Adaptive control; Nash equilibria; leader-follower collective behavior; likelihood ratio based adaptation; mean field (MF) stochastic control; stochastic optimal control;

机译：自适应控制;纳什均衡;领导者跟随者的集体行为;基于似然比的自适应;均值（MF）随机控制;随机最优控制;

相似文献

外文文献
中文文献
专利

1. Stochastic bounded consensus tracking of leader-follower multi-agent systems with measurement noises based on sampled-data with small sampling delay [J] . Wu Z., Peng L., Xie L., Physica, A. Statistical mechanics and its applications . 2013,第4期

机译：基于具有较小采样延迟的采样数据的具有测量噪声的领导者跟随多主体系统的随机有界共识跟踪
2. Stochastic bounded consensus tracking of leader-follower multi-agent systems with measurement noises based on sampled data with general sampling delay [J] . Wu Zhi-Hai, Peng Li, Xie Lin-Bo, 中国物理：英文版 . 2013,第012期

机译：基于具有一般采样延迟的采样数据的带有测量噪声的领导者跟随多主体系统的随机有界共识跟踪
3. Distributed robust consensus control for nonlinear leader-follower multi-agent systems based on adaptive observer-based sliding mode [J] . Rahimi N., Binazadeh T. Journal of vibration and control: JVC . 2019,第1期

机译：基于自适应观察者的滑动模式的非线性引导跟随器多种子体系统的分布式鲁棒共识控制
4. Adaptive Multi-Agent Unmanned Aerial Vehicle Systems with a Potential Field based Leader-Follower Formation Control Method [C] . Jae Chung, Yushing Cheung SAE AeroTech Congress Exhibition . 2015

机译：具有潜在场的引导件 - 跟随器形成控制方法的自适应多功能型空中航空车辆系统
5. Multi-agent based control and reconfiguration for restoration of distribution systems with distributed generators. [D] . Solanki, Jignesh M. 2006

机译：基于多智能体的控制和重新配置，用于使用分布式发电机恢复配电系统。
6. Genetic Optimization-Based Consensus Control of Multi-Agent 6-DoF UAV System [O] . Aws Abdulsalam Najm, Ibraheem Kasim Ibraheem, Ahmad Taher Azar, 2020

机译：基于遗传优化的多Agent 6自由度无人机系统共识控制
7. Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation [O] . Mojtaba Nourian, Student Member, Peter E. Caines, 2014

机译：领导者跟随随机多智能体系统中的平均场LQG控制：基于似然比的自适应

Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅