首页> 外文学位 >Abstracting Influences for Efficient Multiagent Coordination Under Uncertainty.

【24h】

Abstracting Influences for Efficient Multiagent Coordination Under Uncertainty.

机译：不确定条件下有效多主体协作的抽象影响。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

When planning optimal decisions for teams of agents acting in uncertain domains, conventional methods explicitly coordinate all joint policy decisions and, in doing so, are inherently susceptible to the curse of dimensionality, as state, action, and observation spaces grow exponentially with the number of agents. With the goal of extending the scalability of optimal team coordination, the research presented in this dissertation examines how agents can reduce the amount of information they need to coordinate. Intuitively, to the extent that agents are weakly coupled, they can avoid the complexity of coordinating all decisions; they need instead only coordinate abstractions of their policies that convey their essential influences on each other.;In formalizing this intuition, I consider several complementary aspects of weakly-coupled problem structure, including agent scope size, corresponding to the number of an agent's peers whose decisions influence the agent's decisions, and degree of influence, corresponding to the proportion of unique influences that peers can feasibly exert. To exploit this structure, I introduce a (transition-dependent decentralized POMDP) model that efficiently decomposes into local decision models with shared state features. This context yields a novel characterization of influences as transition probabilities (compactly encoded using a dynamic Bayesian network). Not only is this influence representation provably sufficient for optimal coordination, but it also allows me to frame the subproblems of (1) proposing influences, (2) evaluating influences, and (3) computing optimal policies around influences as mixed-integer linear programs.;The primary advantage of working in the influence space is that there are potentially significantly fewer feasible influences than there are policies. Blending prior work on decoupled joint policy search and constraint optimization, I develop influence-space search algorithms that, for problems with a low degree of influence, compute optimal solutions orders of magnitude faster than policy-space search. When agents' influences are constrained, influence-space search also outperforms other state-of-the-art optimal solution algorithms. Moreover, by exploiting both degree of influence and agent scope size, I demonstrate scalability, substantially beyond the reach of prior optimal methods, to teams of 50 weakly-coupled transition-dependent agents.

机译：当为在不确定范围内行动的代理团队规划最佳决策时，常规方法会明确协调所有联合政策决策，并且这样做会固有地易受维度诅咒的影响，因为状态，行动和观察空间会随着决策数量的增长而成倍增长。代理商。为了扩展最佳团队协作的可扩展性，本文提出的研究研究了代理商如何减少他们需要协调的信息量。凭直觉，在主体之间耦合程度较弱的情况下，它们可以避免协调所有决策的复杂性。相反，他们只需要协调其政策的抽象性，以相互传达其实质性的影响。；在使这种直觉形式化时，我考虑了弱耦合问题结构的几个互补方面，包括与代理范围相对应的代理范围大小。决策会影响代理的决策和影响程度，与同伴可以切实发挥作用的独特影响的比例相对应。为了利用这种结构，我引入了一个（依赖于过渡的分散式POMDP）模型，该模型可以有效地分解为具有共享状态特征的局部决策模型。该上下文将影响作为转移概率（使用动态贝叶斯网络进行紧凑编码）进行了新颖的表征。这种影响表示不仅可以证明足以进行最佳协调，而且还使我能够对以下子问题进行框架化：（1）提出影响，（2）评估影响以及（3）将影响附近的最优策略作为混合整数线性程序进行计算。；在影响空间中工作的主要优势在于，与政策相比，可行的影响潜在地要少得多。结合先前在解耦联合策略搜索和约束优化方面的工作，我开发了影响空间搜索算法，该算法对于影响程度较低的问题，比策略空间搜索更快地计算了最佳解决方案。当代理商的影响受到限制时，影响空间搜索也会胜过其他最新的最佳解决方案算法。而且，通过同时利用影响程度和代理范围大小，我证明了对50个弱耦合依赖于过渡的代理团队的可扩展性，这是先前最佳方法无法实现的。

著录项

作者
Witwicki, Stefan J.;
展开▼
作者单位

University of Michigan.;

展开▼
授予单位 University of Michigan.;
学科 Operations Research.;Computer Science.;Artificial Intelligence.
学位 Ph.D.
年度 2011
页码 283 p.
总页数 283
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-17 11:44:52

相似文献

外文文献
中文文献
专利

1. Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation [J] . Georgios C. Chasparis, Jeff S. Shamma Dynamic games and applications . 2012,第1期

机译：分布式动态增强多Agent协调和网络形成中的有效结果
2. Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation [J] . Georgios C. Chasparis, Jeff S. Shamma Dynamic games and applications . 2012,第1期

机译：分布式动态增强多Agent协调和网络形成中的有效结果
3. Efficient and distributable methods for solving the multiagent plan coordination problem [J] . Jeffrey Cox, Edmund Durfee Multiagent and grid systems . 2009,第4期

机译：解决多主体计划协调问题的有效方法
4. Decentralized Multiagent Reinforcement Learning for Efficient Robotic Control by Coordination Graphs [C] . Chao Yu, Dongxu Wang, Jiankang Ren, Pacific Rim international conference on artificial intelligence . 2018

机译：通过协调图进行高效的机器人控制的分散式多主体强化学习
5. Abstract reasoning for multiagent coordination and planning. [D] . Clement, Bradley Jefferson. 2002

机译：多主体协调和计划的抽象推理。
6. Herd Those Sheep: Emergent Multiagent Coordination andBehavioral-Mode Switching [O] . Patrick Nalepka, Rachel W. Kallen, Anthony Chemero, -1

机译：放牧那些羊：紧急多代理协调和行为模式切换
7. Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation [O] . Georgios C. Chasparis, Jeff S. Shamma 2012

机译：分布式动态强化多主体协调和网络形成中的有效结果

Abstracting Influences for Efficient Multiagent Coordination Under Uncertainty.

摘要

著录项

相似文献

相关主题

期刊订阅