首页> 外文会议>2014 Second World Conference on Complex Systems >Policy computation for constrained communicating agents
【24h】

Policy computation for constrained communicating agents

机译:受限通信代理的策略计算

获取原文
获取原文并翻译 | 示例

摘要

Decentralized Markov Decision Processes (DECMDPs) provide powerful modeling tools for cooperative multiagent decision making under uncertainty. However, as basic models, they fail in modeling problems where decision makers must act under time pressure and regarding complex constraints. In this paper, we focus on adapting DEC-MDP model in order to take into account temporal constraints, precedence constraints and uncertain action durations. Particularly, we extend a solution method called opportunity cost DEC-MDP to handle more complex precedence constraints. Because problems we consider require a tight coordination, we introduce communication among agents. We aim at optimizing communication decisions since dealing with offline planning for communication is intractable. To this end, we propose to exploit problem structure in order to limit information sharing. Experimental results show that even if communication is costly, it improves the degree of coordination between agents and it increases team performances regarding constraints.
机译:分散马尔可夫决策过程(DECMDP)为不确定性下的协作多主体决策提供了强大的建模工具。但是,作为基本模型,它们无法对决策者必须在时间压力和复杂约束下采取行动的问题进行建模。在本文中,我们专注于适应DEC-MDP模型,以考虑时间约束,优先约束和不确定的动作持续时间。特别是,我们扩展了一种称为机会成本DEC-MDP的解决方案方法,以处理更复杂的优先级约束。因为我们考虑的问题需要紧密协调,所以我们引入了座席之间的沟通。我们的目标是优化沟通决策,因为处理离线沟通计划非常棘手。为此,我们建议利用问题结构来限制信息共享。实验结果表明,即使沟通成本很高,它也可以提高座席之间的协调程度,并提高团队在约束方面的绩效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号