首页> 外文会议>RoboCup International Symposium >Solving Large-Scale and Sparse-Reward DEC-POMDPs with Correlation-MDPs
【24h】

Solving Large-Scale and Sparse-Reward DEC-POMDPs with Correlation-MDPs

机译:用相关性-MDPS解决大规模和稀疏奖励DEC-POMDPS

获取原文

摘要

Within a group of cooperating agents the decision making of an individual agent depends on the actions of the other agents. A lot of effort has been made to solve this problem with additional assumptions on the communication abilities of agents. However, in some real-world applications, communication is limited and the assumptions are rarely satisfied. An alternative approach newly developed is to employ a correlation device to correlate the agents' behavior without exchanging information during execution. In this paper, we apply correlation device to large-scale and spare-reward domains. As a basis we use the framework of infinite-horizon DEC-POMDPs which represent policies as joint stochastic finite-state controllers. To solve any problem of this kind, a correlation device is firstly calculated by solving Correlation Markov Decision Processes (Correlation-MDPs) and then used to improve the local controller for each agent. By using this method, we are able to achieve a tradeoff between computational complexity and the quality of the approximation. In addition, we demonstrate that, adversarial problems can be solved by encoding the information of opponents' behavior in the correlation device. We have successfully implemented the proposed method into our 2D simulated robot soccer team and the performance in RoboCup-2006 was encouraging.
机译:在一组合作代理中,个别代理的决策取决于其他代理人的行为。已经努力解决了这个问题,以解决代理商的沟通能力的额外假设。然而,在一些现实世界的应用中,通信是有限的,并且很少满足假设。新开发的替代方法是使用相关设备来将代理的行为相关联,而无需在执行期间交换信息。在本文中,我们将相关设备应用于大规模和备用域。作为基础,我们使用Infinite-HorpoNON DEC-POMDP的框架,该框架代表了作为联合随机有限状态控制器的政策。为了解决这种类型的任何问题,首先通过解决相关性马尔可夫决策过程(相关-MDP)来计算相关装置,然后用于改进每个代理的本地控制器。通过使用这种方法,我们能够在计算复杂性和近似质量之间实现权衡。另外,我们证明,通过编码相关装置中的对手的行为来解决对抗性问题。我们已成功实施拟议的方法,进入我们的2D模拟机器人足球队,并在Robocup-2006中的表现令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号