Graphical Models For Interactive Pomdps: Representations And Solutions

Prashant Doshi; Yifeng Zeng; Qiongyu Chen

首页> 外文期刊>Autonomous agents and multi-agent systems >Graphical Models For Interactive Pomdps: Representations And Solutions

【24h】

Graphical Models For Interactive Pomdps: Representations And Solutions

机译：交互式Pomdps的图形模型：表示法和解决方案

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We develop new graphical representations for the problem of sequential decision making in partially observable multiagent environments, as formalized by interactive partially observable Markov decision processes (I-POMDPs). The graphical models called interactive influence diagrams (I-IDs) and their dynamic counterparts, interactive dynamic influence diagrams (I-DIDs), seek to explicitly model the structure that is often present in real-world problems by decomposing the situation into chance and decision variables, and the dependencies between the variables. I-DIDs generalize DIDs, which may be viewed as graphical representations of POMDPs, to multiagent settings in the same way that I-POMDPs generalize POMDPs. I-DIDs may be used to compute the policy of an agent given its belief as the agent acts and observes in a setting that is populated by other interacting agents. Using several examples, we show how I-IDs and I-DIDs may be applied and demonstrate their usefulness. We also show how the models may be solved using the standard algorithms that are applicable to DIDs. Solving I-DIDs exactly involves knowing the solutions of possible models of the other agents. The space of models grows exponentially with the number of time steps. We present a method of solving I-DIDs approximately by limiting the number of other agents' candidate models at each time step to a constant. We do this by clustering models that are likely to be behaviorally equivalent and selecting a representative set from the clusters. We discuss the error bound of the approximation technique and demonstrate its empirical performance.

机译：我们针对部分可观察的多主体环境中的顺序决策问题开发了新的图形表示，通过交互式的部分可观察的马尔可夫决策过程（I-POMDP）形式化。称为交互式影响图（I-ID）及其动态对应物的图形模型，即交互式动态影响图（I-DID），试图通过将情况分解为机会和决策来明确地模拟现实世界中经常出现的结构。变量，以及变量之间的依赖关系。 I-DID将DID（可以视为POMDP的图形表示形式）推广到多代理程序设置，其方式与I-POMDP推广POMDP的方式相同。 I-DID可以用于计算代理的策略，前提是它相信代理在代理行为并在由其他交互代理填充的设置中进行观察时所遵循的策略。通过使用几个示例，我们展示了如何应用I-ID和I-DID并展示了它们的有用性。我们还展示了如何使用适用于DID的标准算法来求解模型。解决I-DID确实涉及了解其他代理的可能模型的解决方案。模型的空间随时间步长的增长呈指数增长。我们提出了一种通过将每个时间步骤中其他代理的候选模型的数量限制为一个常数来近似解决I-DID的方法。为此，我们对可能在行为上等效的模型进行聚类，然后从聚类中选择一个代表性的集合。我们讨论了近似技术的误差范围，并证明了其经验性能。

著录项

来源
《Autonomous agents and multi-agent systems》 |2009年第3期|p.376-416|共41页
作者
Prashant Doshi; Yifeng Zeng; Qiongyu Chen;
展开▼
作者单位

Department of Computer Science and Institute for AI, University of Georgia, Athens, GA 30602, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
probabilistic graphical models; interactive pomdps; sequential multiagent decision making;

机译：概率图形模型;交互式Pompps;顺序多主体决策;

相似文献

外文文献
中文文献
专利

1. A Study of the Graphical Representation of Plain-knitted Structures Part I: Stitch Model for the Graphical Representation of Plain-knitted Structures [J] . A. Demiroz, T. Dias The Journal of the Textile Institute. 1 . 2000,第4期

机译：平针织结构图形表示的研究第一部分：平针织结构图形表示的针迹模型
2. Interactive POMDPs with finite-state models of other agents [J] . Panella Alessandro, Gmytrasiewicz Piotr Autonomous agents and multi-agent systems . 2017,第4期

机译：具有其他代理的有限状态模型的交互式POMDP
3. Scalable solutions of interactive POMDPs using generalized and bounded policy iteration [J] . Ekhlas Sonu, Prashant Doshi Autonomous Agents and Multi-Agent Systems . 2015,第3期

机译：使用广义和有界策略迭代的交互式POMDP的可扩展解决方案
4. Graphical models for online solutions to interactive POMDPs [C] . Prashant Doshi, Yifeng Zeng, Qiongyu Chen, International joint conference on Autonomous agents and multiagent systems . 2007

机译：用于交互式POMDP的在线解决方案的图形模型
5. Automatic and interactive segmentations using deformable and graphical models [D] . Uzunbas, Mustafa Gokhan. 2015

机译：使用可变形和图形模型的自动和交互式分段
6. Modeling and Optimization of Manufacturing Process Performance using Modelica Graphical Representation and Process Analytics Formalism [O] . Guodong Shao, Alexander Brodsky, Ryan Miller -1

机译：使用Modelica图形表示和过程分析形式化对制造过程绩效进行建模和优化
7. Graphical Models for Interactive POMDPs:Representations and Solutions [O] . Doshi, Prashant, Zeng, Yifeng, Chen, Qiongyu 2009

机译：交互式pOmDp的图形模型：表示和解决方案
8. Graphical Representations and Causal Models in Intelligent Interactive LearningEnvironments [R] . Reiser, B. J. 1996

机译：智能交互式学习环境中的图形表示和因果模型

Graphical Models For Interactive Pomdps: Representations And Solutions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅