首页> 外文学位 >Execution-time communication decisions for coordination of multi-agent teams.

【24h】

Execution-time communication decisions for coordination of multi-agent teams.

机译：执行时间沟通决策，以协调多主体团队。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although multi-agent teams provide additional functionality and robustness over single-agent systems, they also present additional challenges, mainly due to the difficulty of coordinating multiple agents in the presence of uncertainty and partial observability. Agents must reason about the collective state and behaviors of the team as well as uncertainty in their own environment.;In this thesis, we employ Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs), an extension of single-agent POMDPs that can be used to model and coordinate teams of agents. Although the problem of finding optimal policies for Dec-POMDPs is highly intractable, it is known that the presence of free communication transforms a multi-agent Dec-POMDP into a more tractable single-agent POMDP. We use this transformation to generate "centralized" policies for multi-agent teams modeled by Dec-POMDPs. We facilitate the decentralize execution of these centralized policies by providing algorithms that allow agents to reason about communication at execution-time. Our approach trades off the need to do some computation at execution-time for the ability to generate policies more tractably at plan-lime.;This thesis explores the question of how communication can be used effectively to enable the coordination of cooperative multi-agent teams making sequential decisions under uncertainty and partial observability. We identify two fundamental questions that must be answered when reasoning about communication: "When should agents communicate," and "What should agents communicate?" We present two basic approaches to enabling a team of distributed agents to avoid coordination errors, The first is an algorithm that reasons over the possible joint beliefs the team. We provide algorithms that address the questions of when and what agents should communicate.;The second approach presented in this thesis avoids coordination errors by creating individual factored policy for each agent. Factored policies provide a means for determining which state features agents should communicate, answering the questions of when and what agents should communicate. We use factored policies to identify instances of context-specific independence, in which agents can act without needing to consider the actions or observations of their teammates.

机译：尽管与单代理系统相比，多代理团队提供了更多的功能和鲁棒性，但它们也带来了其他挑战，这主要是由于在存在不确定性和部分可观察性的情况下很难协调多个代理。代理商必须对团队的集体状态和行为以及自身环境中的不确定性进行推理；在本论文中，我们采用分散的部分可观察的马尔可夫决策过程（Dec-POMDP），这是对单代理商POMDP的扩展。用于建模和协调代理商团队。尽管找到针对Dec-POMDP的最佳策略的问题非常棘手，但是众所周知，自由通信的存在将多代理Dec-POMDP转换为更易于处理的单代理POMDP。我们使用此转换为由Dec-POMDP建模的多代理团队生成“集中式”策略。通过提供允许代理在执行时推理通信的算法，我们促进了这些集中策略的分散执行。我们的方法权衡了在执行时需要进行一些计算的需求，以便能够在计划期限内更灵活地生成策略。;本文探讨了如何有效利用沟通来实现协作式多代理团队协调的问题在不确定性和部分可观察性下做出顺序决策。我们确定了在进行交流推理时必须回答的两个基本问题：“代理商应该何时交流”和“代理商应该交流什么？”。我们提出了两种基本方法来使分布式代理团队能够避免协调错误，第一种是一种算法，该算法会推理出团队可能的共同信念。我们提供了解决何时何人与代理人进行交流的问题的算法。本文提出的第二种方法是通过为每个代理人创建个性化策略来避免协调错误。析因策略提供了一种方法，可用来确定代理程序应传达哪些状态功能，并回答何时以及什么代理程序应传达的问题。我们使用分解策略来确定特定于上下文的独立性实例，在这种情况下，座席可以采取行动而无需考虑队友的行动或观察。

著录项

作者
Roth, Maayan.;
展开▼
作者单位

Carnegie Mellon University.;

展开▼
授予单位 Carnegie Mellon University.;
学科 Engineering Robotics.;Computer Science.
学位 Ph.D.
年度 2007
页码 152 p.
总页数 152
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Semi-global leader-following coordination of multi-agent systems with input saturation and aperiodic intermittent communications [J] . Fan Zhipeng, Su Housheng, Che Shiming, Journal of the Franklin Institute . 2019 ,第2期

机译：具有输入饱和和非周期性间歇通信的多智能体系统的半全局领导者跟踪协调
2. Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems [J] . Eugenio Bargiacchi, Timothy Verstraeten, Diederik Roijers, JMLR: Workshop and Conference Proceedings . 2018 ,第3期

机译：在重复的单阶段多智能体决策问题中学习与协调图协调
3. Introduction to the Special Issue of Group Decision and Negotiation 2002: Theory and Practice of Computational Coordination Mechanisms in Multi-Agent Systems [J] . PEYMAN FARATIN, NICHOLAS R. JENNINGS Group decision and negotiation . 2003 ,第5期

机译：2002年小组决策和谈判特刊简介：多智能体系统中的计算协调机制的理论与实践
4. Reasoning about joint beliefs for execution-time communication decisions [C] . Maayan Roth, Reid Simmons, Manuela Veloso International joint conference on Autonomous agents and multiagent systems . 2005

机译：关于执行时通信决策的联合信念的推理
5. Distributed coordination of multi-agent systems based on estimation over ad-hoc communication networks. [D] . Sun, Yashan. 2007

机译：基于自组织通信网络上的估计的多主体系统的分布式协调。
6. MARS a Multi-Agent System for Assessing Rowers Coordination via Motion-Based Stigmergy [O] . Marco Avvenuti, Daniel Cesarini, Mario G. C. A. Cimino 2013

机译：MARS一种多智能体系统可通过基于运动的Stigmergy评估划船者的协调能力
7. Reward Shaping for Valuing Communications During Multi-Agent Coordination [O] . Williamson Simon A., Gerding Enrico H., Jennings Nicholas R. 2009

机译：多agent协调期间评估通信的奖励
8. Communication Efficient Motion Coordination and Data Fusion in Information Gathering Teams. [R] . Kassir, A., Fitch, R., Sukkarieh, S. 2016

机译：信息收集小组中的通信高效运动协调与数据融合。

Execution-time communication decisions for coordination of multi-agent teams.

摘要

著录项

相似文献

相关主题

期刊订阅