Scalable, MDP-based planning for multiple, cooperating agents

机译：可扩展的基于MDP的计划，用于多个协作代理

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n − 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.

机译：本文介绍了一种基于马尔可夫决策过程（MDP）的随机多主体规划近似算法。具体来说，我们集中于一种分散的方法来计划由燃料消耗和健康相关模型不确定的合作代理商团队的行动。本文提出的算法背后的核心思想是允许每个特工近似其队友的代表。因此，每个特工都拥有自己的计划者，该计划者会在枚举其队友状态时充分枚举其本地状态和行为。在先前的工作中，作者分别估计了每个队友，从而大大减少了规划空间，但在计算可扩展性方面保持指数级（在n -1而不是n，其中n是代理的数量）。本文扩展了该方法，并提出了一种新的近似方法，它将所有队友聚集到一个抽象的实体中。在具有3个代理的持久搜索和跟踪任务场景下，我们显示，与集中式最佳解决方案相比，虽然性能降低了近20％，但问题的大小在n中呈线性变化，这是在线计划大型多智能体时非常有吸引力的功能团队。

著录项

来源
《American Control Conference;ACC》|2012年|p.6011- 6016|共6页
会议地点 Montreal(CA)
作者
Redding, Joshua D.;
展开▼
作者单位

Aerospace Controls Laboratory MIT Cambridge MA USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Polarized marketing resource planning model for super descreate micro agent knowledge system using applied cooperate finance model connected with geometric economic researches [J] . Akihiko Henmi 電子情報通信学会技術研究報告. ヒュ-マン情報処理. Human Information Processing . 2003,第39期

机译：结合几何经济学研究的应用合作财务模型建立超级微代理知识系统的极化营销资源计划模型
2. Polarized marketing resource planning model for super descreate micro agent knowledge system using applied cooperate finance model connected with geometric economic researches [J] . Akihiko Henmi 電子情報通信学会技術研究報告. ヒュ-マン情報処理. Human Information Processing . 2003,第39期

机译：超级描述微代理知识系统的偏振营销资源规划模型使用应用合作金融模型与几何经济研究
3. Reliability assessment of multiple-agent cooperating systems [J] . Yen I.-L., Chen I.-R. IEEE Transactions on Reliability . 1997,第3期

机译：多主体协作系统的可靠性评估
4. Scalable, MDP-based Planning for Multiple, Cooperating Agents [C] . Joshua D. Redding, N. Kemal Ure, Jonathan P. How, American Control Conference . 2012

机译：可扩展，基于MDP的多种合作代理的规划
5. Autonomous Mission Planning for Unmanned Surface Vehicles Piloted by Multiple Specialized Agents Using Heuristic and Metaheuristic Techniques [D] . Krell, Evan. 2018

机译：使用启发式和元启发式技术的由多个特工驾驶的无人水面飞行器的自主任务计划
6. Towards treatment planning of COVID-19: Rationale and hypothesis for the use of multiple immunosuppressive agents: Anti-antibodies immunoglobulins and corticosteroids [O] . Amene Saghazadeh, Nima Rezaei -1

机译：制定COVID-19的治疗计划：使用多种免疫抑制剂的原理和假设：抗抗体免疫球蛋白和皮质类固醇
7. Modeling Cooperating Agents Scenarios by Deductive Planning Methods and Logical Fiberings [O] . Jochen Pfalzgraf, Ute Cornelia Sigmund, Karel Stokkermans 1994

机译：通过演绎规划方法和逻辑纤维对协作代理方案进行建模
8. The achievement of spacecraft autonomy through the thematic application of multiple cooperating intelligent agents [R] . Rossomando, Philip J. 1992

机译：通过多个协作智能代理的主题应用实现航天器自治

Scalable, MDP-based planning for multiple, cooperating agents

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅