首页> 外文会议> >Improving coordination with communication in multi-agent reinforcement learning
【24h】

Improving coordination with communication in multi-agent reinforcement learning

机译:在多主体强化学习中改善与沟通的协调

获取原文

摘要

We present a new algorithm for cooperative reinforcement learning in multiagent systems. We consider autonomous and independently learning agents, and we seek to obtain an optimal solution for the team as a whole while keeping the learning as much decentralized as possible. Coordination between agents occurs through communication, namely the mutual notification algorithm. We define the learning problem as a decentralized process using the MDP formalism. We then give an optimality criterion and prove the convergence of the algorithm for deterministic environments. We introduce variable and hierarchical communication strategies which considerably reduce the number of communications. Finally we study the convergence properties and communication overhead on a small example.
机译:我们在多读系统中提出了一种新的合作加固学习算法。我们考虑自动和独立的学习代理,我们寻求整个团队获得最佳解决方案,同时保持学习尽可能多地分散。代理之间的协调通过通信发生,即互通知算法。我们将学习问题定义为使用MDP形式主义的分散过程。然后,我们提供了最佳标准,并证明了确定性环境算法的融合。我们引入可变和分层通信策略,这些策略大大减少了通信数量。最后,我们研究了一个小例子上的收敛性和通信开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号