首页> 外文期刊>Automatic Control, IEEE Transactions on >Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation
【24h】

Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

机译:跟随者随机多智能体系统中的平均场LQG控制:基于似然比的自适应

获取原文
获取原文并翻译 | 示例

摘要

We study large population leader-follower stochastic multi-agent systems where the agents have linear stochastic dynamics and are coupled via their quadratic cost functions. The cost of each leader is based on a trade-off between moving toward a certain reference trajectory which is unknown to the followers and staying near their own centroid. On the other hand, followers react by tracking a convex combination of their own centroid and the centroid of the leaders. We approach this large population dynamic game problem by use of so-called Mean Field (MF) linear-quadratic-Gaussian (LQG) stochastic control theory. In this model, followers are adaptive in the sense that they use a likelihood ratio estimator (on a sample population of the leaders' trajectories) to identify the member of a given finite class of models which is generating the reference trajectory of the leaders. Under appropriate conditions, it is shown that the true reference trajectory model is identified by each follower in finite time with probability one as the leaders' population goes to infinity. Furthermore, we show that the resulting sets of mean field control laws for both leaders and adaptive followers possess an almost sure $varepsilon_{N}$-Nash equilibrium property for a system with population $N$ where $varepsilon_{N}$ goes to zero as $N$ goes to infinity. Numerical experiments are presented illustrating the results.
机译:我们研究了人口众多的跟随者跟随随机多智能体系统,这些智能体具有线性随机动力学并通过其二次成本函数进行耦合。每个领导者的成本是基于朝着跟随者未知的某个参考轨迹与保持靠近自己的质心之间的权衡。另一方面,跟随者通过跟踪自己的质心和领导者质心的凸组合来做出反应。我们通过使用所谓的平均场(MF)线性二次高斯(LQG)随机控制理论来解决这一大型种群动态博弈问题。在此模型中,跟随者是自适应的,即跟随者使用似然比估计值(基于领导者轨迹的样本)来识别给定有限类模型的成员,该模型正在生成领导者的参考轨迹。结果表明,在适当的条件下,随着领导者人数的增加,每个跟随者在有限的时间内都可以识别出真实的参考轨迹模型,概率为1。此外,我们表明,对于具有种群$ N $的系统(其中$ varepsilon_ {N} $进入)的系统,对于领导者和适应性跟随者而言,所得的平均场控制定律集几乎具有肯定的$ varepsilon_ {N} $-Nash均衡性质。 $ N $变为无穷大时为零。数值实验表明了结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号