首页> 外文学位 >Game theoretic control for robot teams.
【24h】

Game theoretic control for robot teams.

机译:机器人团队的博弈论控制。

获取原文
获取原文并翻译 | 示例

摘要

Planning for a decentralized team of robots is a fundamentally different problem from that of centralized control. During decision making, robots must take into account not only their own observations of world state, but also the possible observations and actions of teammates. While the interconnectedness of such a reasoning process seems to require an infinite recursion of beliefs to be modelled by each member of the team, game theory provides an alternative approach. Partially observable stochastic games (POSGs) generalize notions of single-stage games and Markov decision processes to both multiple agents and partially observable worlds. Even if there is only limited communication between teammates, POSGs allow robots to come up with policies that still take into account possible teammate experiences without the need to explicitly model any recursive beliefs about those experiences.; While a powerful model of decentralized teams, POSGs are computationally intractable for all but the smallest problems. This dissertation proposes a Bayesian game approximation to POSGs in which game-theoretic reasoning about action selection is retained, but robots reason only a limited time ahead about uncertainty in world state and the experiences of their teammates. Planning and execution are interleaved to further reduce computational burdens: at each timestep, robots perform a step of full game-theoretic reasoning about their current action selection given any possible history of observations and a heuristic evaluation of the expected future value of those decisions.; The Bayesian game approximation algorithm (BaGA) is able to find solutions to much larger problems than previously solved. Further computational savings are gained by reasoning about groups of similar observation histories rather than single histories. Finally, efficiency and performance are also improved through the use of run-time communication policies that trade off expected gains in performance with the costs of using bandwidth. In this dissertation, the performance of BaGA is compared to policies generated for full POSGs as well as heuristics. BaGA is also used to develop real-time robot controllers for a series of simulated and physical robotic tag problems that gradually increase in realism.
机译:规划分散的机器人团队与集中控制是根本不同的问题。在决策过程中,机器人不仅必须考虑自己对世界状态的观察,还应考虑队友的可能观察和行动。尽管这种推理过程的相互联系似乎需要团队的每个成员对信念进行无限递归,但是博弈论提供了一种替代方法。部分可观察的随机博弈(POSG)将单阶段博弈和马尔可夫决策过程的概念推广到多个代理和部分可观察的世界。即使队友之间的交流有限,POSG仍允许机器人提出仍考虑可能的队友经验的策略,而无需明确建模有关这些经验的任何递归信念。 POSG是强大的分散团队模型,除最小的问题外,在计算上都是棘手的。本文提出了一种对POSG的贝叶斯博弈近似方法,其中保留了关于动作选择的博弈论推理,但机器人仅在有限的时间内就世界状态的不确定性及其队友的经历进行了推理。计划和执行被交织在一起,以进一步减轻计算负担:在每个时间步,机器人都会根据其当前动作选择执行完整的游戏理论推理步骤,并给出观察的任何可能的历史记录,并对这些决策的预期未来价值进行启发式评估。贝叶斯博弈近似算法(BaGA)能够找到比以前解决的问题大得多的解决方案。通过推理类似观察历史的组而不是单个历史,可以进一步节省计算量。最后,通过使用运行时通信策略还可以提高效率和性能,该策略在性能预期收益与带宽使用成本之间进行权衡。本文将BaGA的性能与针对完整POSG以及启发式策略生成的策略进行比较。 BaGA还用于开发实时机器人控制器,以解决一系列现实中逐渐增加的模拟和物理机器人标签问题。

著录项

  • 作者

    Emery-Montemerlo, Rosemary.;

  • 作者单位

    Carnegie Mellon University.;

  • 授予单位 Carnegie Mellon University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 253 p.
  • 总页数 253
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号