首页> 外文学位 >Execution-time communication decisions for coordination of multi-agent teams.
【24h】

Execution-time communication decisions for coordination of multi-agent teams.

机译:执行时间沟通决策,以协调多主体团队。

获取原文
获取原文并翻译 | 示例

摘要

Although multi-agent teams provide additional functionality and robustness over single-agent systems, they also present additional challenges, mainly due to the difficulty of coordinating multiple agents in the presence of uncertainty and partial observability. Agents must reason about the collective state and behaviors of the team as well as uncertainty in their own environment.;In this thesis, we employ Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs), an extension of single-agent POMDPs that can be used to model and coordinate teams of agents. Although the problem of finding optimal policies for Dec-POMDPs is highly intractable, it is known that the presence of free communication transforms a multi-agent Dec-POMDP into a more tractable single-agent POMDP. We use this transformation to generate "centralized" policies for multi-agent teams modeled by Dec-POMDPs. We facilitate the decentralize execution of these centralized policies by providing algorithms that allow agents to reason about communication at execution-time. Our approach trades off the need to do some computation at execution-time for the ability to generate policies more tractably at plan-lime.;This thesis explores the question of how communication can be used effectively to enable the coordination of cooperative multi-agent teams making sequential decisions under uncertainty and partial observability. We identify two fundamental questions that must be answered when reasoning about communication: "When should agents communicate," and "What should agents communicate?" We present two basic approaches to enabling a team of distributed agents to avoid coordination errors, The first is an algorithm that reasons over the possible joint beliefs the team. We provide algorithms that address the questions of when and what agents should communicate.;The second approach presented in this thesis avoids coordination errors by creating individual factored policy for each agent. Factored policies provide a means for determining which state features agents should communicate, answering the questions of when and what agents should communicate. We use factored policies to identify instances of context-specific independence, in which agents can act without needing to consider the actions or observations of their teammates.
机译:尽管与单代理系统相比,多代理团队提供了更多的功能和鲁棒性,但它们也带来了其他挑战,这主要是由于在存在不确定性和部分可观察性的情况下很难协调多个代理。代理商必须对团队的集体状态和行为以及自身环境中的不确定性进行推理;在本论文中,我们采用分散的部分可观察的马尔可夫决策过程(Dec-POMDP),这是对单代理商POMDP的扩展。用于建模和协调代理商团队。尽管找到针对Dec-POMDP的最佳策略的问题非常棘手,但是众所周知,自由通信的存在将多代理Dec-POMDP转换为更易于处理的单代理POMDP。我们使用此转换为由Dec-POMDP建模的多代理团队生成“集中式”策略。通过提供允许代理在执行时推理通信的算法,我们促进了这些集中策略的分散执行。我们的方法权衡了在执行时需要进行一些计算的需求,以便能够在计划期限内更灵活地生成策略。;本文探讨了如何有效利用沟通来实现协作式多代理团队协调的问题在不确定性和部分可观察性下做出顺序决策。我们确定了在进行交流推理时必须回答的两个基本问题:“代理商应该何时交流”和“代理商应该交流什么?”。我们提出了两种基本方法来使分布式代理团队能够避免协调错误,第一种是一种算法,该算法会推理出团队可能的共同信念。我们提供了解决何时何人与代理人进行交流的问题的算法。本文提出的第二种方法是通过为每个代理人创建个性化策略来避免协调错误。析因策略提供了一种方法,可用来确定代理程序应传达哪些状态功能,并回答何时以及什么代理程序应传达的问题。我们使用分解策略来确定特定于上下文的独立性实例,在这种情况下,座席可以采取行动而无需考虑队友的行动或观察。

著录项

  • 作者

    Roth, Maayan.;

  • 作者单位

    Carnegie Mellon University.;

  • 授予单位 Carnegie Mellon University.;
  • 学科 Engineering Robotics.;Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 152 p.
  • 总页数 152
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号