首页> 外文会议>IEEE International Conference on System, Man, and Cybernetics >An MDP-based policy for stochastic multi-agent domains
【24h】

An MDP-based policy for stochastic multi-agent domains

机译:基于MDP的随机多算子域的策略

获取原文

摘要

Stochastic environments pose challenging problems for agents trying to act optimally in the presence of other agents. In such environments agents have to contend with the probabilistic effects of other agents' actions, their inability tocompletely observe the state of the world before selecting the next action and in some cases the high cost of communication. In this paper, we show how such systems can be modeled as multi-agent Markov decison processes. We describe a policy thatprescribes an action that has a high probability of being the optimal action under a given global state distribution and present an algorithm that agents can use to act in such environments while attempting to achieve their goals.
机译:随机环境对试图在其他代理商的存在下最佳行动的代理构成具有挑战性的问题。在这种环境中,代理人必须抗争解其他代理人的行动的概率影响,在选择下一个行动之前,他们无法遵守世界的国家,并且在某些情况下沟通的高成本。在本文中,我们展示了这种系统如何建模为多代理马尔可夫甲板流程。我们描述了一项规范,该政策专项是在给定的全局状态分布下的最佳行动的概率高概率的策略,并且在尝试实现目标时,代理可以使用代理商可以用于在这种环境中起作用的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号