Game theoretic control for robot teams.

机译：机器人团队的博弈论控制。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Planning for a decentralized team of robots is a fundamentally different problem from that of centralized control. During decision making, robots must take into account not only their own observations of world state, but also the possible observations and actions of teammates. While the interconnectedness of such a reasoning process seems to require an infinite recursion of beliefs to be modelled by each member of the team, game theory provides an alternative approach. Partially observable stochastic games (POSGs) generalize notions of single-stage games and Markov decision processes to both multiple agents and partially observable worlds. Even if there is only limited communication between teammates, POSGs allow robots to come up with policies that still take into account possible teammate experiences without the need to explicitly model any recursive beliefs about those experiences.; While a powerful model of decentralized teams, POSGs are computationally intractable for all but the smallest problems. This dissertation proposes a Bayesian game approximation to POSGs in which game-theoretic reasoning about action selection is retained, but robots reason only a limited time ahead about uncertainty in world state and the experiences of their teammates. Planning and execution are interleaved to further reduce computational burdens: at each timestep, robots perform a step of full game-theoretic reasoning about their current action selection given any possible history of observations and a heuristic evaluation of the expected future value of those decisions.; The Bayesian game approximation algorithm (BaGA) is able to find solutions to much larger problems than previously solved. Further computational savings are gained by reasoning about groups of similar observation histories rather than single histories. Finally, efficiency and performance are also improved through the use of run-time communication policies that trade off expected gains in performance with the costs of using bandwidth. In this dissertation, the performance of BaGA is compared to policies generated for full POSGs as well as heuristics. BaGA is also used to develop real-time robot controllers for a series of simulated and physical robotic tag problems that gradually increase in realism.

机译：规划分散的机器人团队与集中控制是根本不同的问题。在决策过程中，机器人不仅必须考虑自己对世界状态的观察，还应考虑队友的可能观察和行动。尽管这种推理过程的相互联系似乎需要团队的每个成员对信念进行无限递归，但是博弈论提供了一种替代方法。部分可观察的随机博弈（POSG）将单阶段博弈和马尔可夫决策过程的概念推广到多个代理和部分可观察的世界。即使队友之间的交流有限，POSG仍允许机器人提出仍考虑可能的队友经验的策略，而无需明确建模有关这些经验的任何递归信念。 POSG是强大的分散团队模型，除最小的问题外，在计算上都是棘手的。本文提出了一种对POSG的贝叶斯博弈近似方法，其中保留了关于动作选择的博弈论推理，但机器人仅在有限的时间内就世界状态的不确定性及其队友的经历进行了推理。计划和执行被交织在一起，以进一步减轻计算负担：在每个时间步，机器人都会根据其当前动作选择执行完整的游戏理论推理步骤，并给出观察的任何可能的历史记录，并对这些决策的预期未来价值进行启发式评估。贝叶斯博弈近似算法（BaGA）能够找到比以前解决的问题大得多的解决方案。通过推理类似观察历史的组而不是单个历史，可以进一步节省计算量。最后，通过使用运行时通信策略还可以提高效率和性能，该策略在性能预期收益与带宽使用成本之间进行权衡。本文将BaGA的性能与针对完整POSG以及启发式策略生成的策略进行比较。 BaGA还用于开发实时机器人控制器，以解决一系列现实中逐渐增加的模拟和物理机器人标签问题。

著录项

作者
Emery-Montemerlo, Rosemary.;
展开▼
作者单位

Carnegie Mellon University.;

展开▼
授予单位 Carnegie Mellon University.;
学科 Computer Science.
学位 Ph.D.
年度 2005
页码 253 p.
总页数 253
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Computer vision and pursuit–evasion game theoretical controls for ground robots [J] . Dan Shen, Haibin Ling, Khanh Pham, Advances in Mechanical Engineering . 2019,第8期

机译：地面机器人的计算机视觉和追逃游戏理论控制
2. Computer vision and pursuit–evasion game theoretical controls for ground robots [J] . Dan Shen, Haibin Ling, Khanh Pham, Advances in Mechanical Engineering . 2019,第8期

机译：地面机器人的计算机视觉和追逃游戏理论控制
3. Haptic Shared Control for Human-Robot Collaboration: A Game-Theoretical Approach [J] . Selma Musi?, Sandra Hirche IFAC PapersOnLine . 2020,第2期

机译：人体机器人协作的触觉共享控制：一种游戏理论方法
4. Holons and Agents in Robotic Teams. A Synergistic Approach [C] . Boldur BARBAT, Ciprian CANDEA, Constantin ZAMFIRESCU International NAISO(Natural amp; Artificial Intelligence Systems Organization) Congress on Information Science Innovations Mar 17-21, 2001 Dubai, U.A.E. . 2001

机译：机器人团队中的Holon和Agent。协同方法
5. Coordination Strategies for Human Supervisory Control of Robotic Teams. [D] . Peters, Jeffrey Russell. 2017

机译：机器人团队人员监督控制的协调策略。
6. Robotics and Gaming to Improve Ankle Strength Motor Control and Function in Children With Cerebral Palsy — A Case Study Series [O] . Grigore C. Burdea, Daniel Cioi, Angad Kale, -1

机译：机器人和游戏提高脚踝强度电机控制和脑瘫儿童的功能 - 案例研究系列
7. Computer vision and pursuit–evasion game theoretical controls for ground robots [O] . Dan Shen, Haibin Ling, Khanh Pham, 2019

机译：电脑愿景和追求逃避游戏的理论控制机器人
8. Behavioral Dynamics in the Cooperative Control of Mixed Human/Robotic Teams. [R] . Baillieul, J., Castanon, D., Cohen, J. D., 2015

机译：混合人/机器人团队协同控制中的行为动力学。

Game theoretic control for robot teams.

摘要

著录项

相似文献

相关主题

期刊订阅