Policy computation for constrained communicating agents

机译：受限通信代理的策略计算

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Decentralized Markov Decision Processes (DECMDPs) provide powerful modeling tools for cooperative multiagent decision making under uncertainty. However, as basic models, they fail in modeling problems where decision makers must act under time pressure and regarding complex constraints. In this paper, we focus on adapting DEC-MDP model in order to take into account temporal constraints, precedence constraints and uncertain action durations. Particularly, we extend a solution method called opportunity cost DEC-MDP to handle more complex precedence constraints. Because problems we consider require a tight coordination, we introduce communication among agents. We aim at optimizing communication decisions since dealing with offline planning for communication is intractable. To this end, we propose to exploit problem structure in order to limit information sharing. Experimental results show that even if communication is costly, it improves the degree of coordination between agents and it increases team performances regarding constraints.

机译：分散马尔可夫决策过程（DECMDP）为不确定性下的协作多主体决策提供了强大的建模工具。但是，作为基本模型，它们无法对决策者必须在时间压力和复杂约束下采取行动的问题进行建模。在本文中，我们专注于适应DEC-MDP模型，以考虑时间约束，优先约束和不确定的动作持续时间。特别是，我们扩展了一种称为机会成本DEC-MDP的解决方案方法，以处理更复杂的优先级约束。因为我们考虑的问题需要紧密协调，所以我们引入了座席之间的沟通。我们的目标是优化沟通决策，因为处理离线沟通计划非常棘手。为此，我们建议利用问题结构来限制信息共享。实验结果表明，即使沟通成本很高，它也可以提高座席之间的协调程度，并提高团队在约束方面的绩效。

著录项

来源
《2014 Second World Conference on Complex Systems》|2014年|548-553|共6页
会议地点 Agadir(MA)
作者
Abdelmoumene Hiba; Belleili Habiba;
展开▼
作者单位

Comput. Sci. Dept., Badji Mokhtar Univ., Annaba, Algeria;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Communication; Decentralized Markov Decision Process; Execution constraints; Planning under uncertainty;

机译：沟通;分散的马尔可夫决策过程;执行约束;不确定性下的计划;
入库时间 2022-08-26 14:23:40

相似文献

外文文献
中文文献
专利

1. A distributed multiple dimensional QoS constrained resource scheduling optimization policy in computational grid [J] . Li CL, Li LY Journal of computer and system sciences . 2006,第4期

机译：计算网格中的分布式多维QoS约束资源调度优化策略
2. Multi-agent reinforcement learning based maintenance policy for a resource constrained flow line system [J] . Wang Xiao, Wang Hongwei, Qi Chao Journal of Intelligent Manufacturing . 2016,第2期

机译：资源受限流水线系统的基于多主体强化学习的维护策略
3. A computational model of the allocentric and egocentric spatial memory by means of virtual agents, or how simple virtual agents can help to build complex computational models [J] . Cyril Brom, Jan Vyhnanek, Jiri Lukavsky, Cognitive Systems Research . 2012,第期

机译：借助虚拟代理的同心和自我中心空间记忆的计算模型，或者简单的虚拟代理如何帮助构建复杂的计算模型
4. Policy computation for constrained communicating agents [C] . Abdelmoumene Hiba, Belleili Habiba World Conference on Complex Systems . 2014

机译：约束通信代理的策略计算
5. Communicating a frame for service-learning: Engaging students as learners, citizens, and/or change agents. [D] . Britt, Lori L. 2010

机译：交流服务学习框架：让学生成为学习者，公民和/或变革推动者。
6. Architecture to Embed Software Agents in Resource Constrained Internet of Things Devices [O] . Daniel H. De La Iglesia, Gabriel Villarrubia González, André Sales Mendes, 2019

机译：在资源受限的物联网设备中嵌入软件代理的体系结构
7. A Computational Semantics for Communicating Rational Agents Based on Mental Models [O] . Koen V. Hindriks, M. Birna Van Riemsdijk 2012

机译：基于心智模型的Rational Agent通信的计算语义
8. An Agent-Based Model for Analyzing Control Policies and the Dynamic Service-Time Performance of a Capacity-Constrained Air Traffic Management Facility [R] . Conway, Sheila R. 2006

机译：基于agent的容量约束空中交通管理设施控制策略和动态服务时间性能分析模型

Policy computation for constrained communicating agents

摘要

著录项

相似文献

相关主题

期刊订阅