首页> 美国政府科技报告 >New Algorithms for Collaborative and Adversarial Decision Making in Partially Observable Stochastic Games

【24h】

New Algorithms for Collaborative and Adversarial Decision Making in Partially Observable Stochastic Games

机译：部分可观测随机游戏协同与对抗决策的新算法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The project has produced new computational models and algorithms for coordination, prediction and planning in situations involving multiple decision makers that operate over an extended period of time in either collaborative or adversarial domains. This includes the development of the decentralized partially-observable Markov decision process (DEC-POMDP) model, memory-bounded algorithm for solving finite-horizon DEC-POMDPs, sparse representations of agent strategies using finite-state controllers, bounded policy iteration algorithms for infinite-horizon DEC-POMDPs, and algorithms for solving DEC- POMDPs using non-linear optimization methods. The project produced the best existing exact algorithms for these problems as well as scalable approximation techniques and benchmark problems that are now widely used within the multi- agent systems community. The report describes these research accomplishments and provides references to published papers and PhD dissertations that include detailed descriptions of the results.

著录项

作者
Zilberstein, S.;
展开▼
作者单位

展开▼
年度 2009
页码 p.1-11
总页数 11
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Decision making; Algorithms; Stochastic processes; Mathematical models; Theses; Parallel processing; Iterations; Markov processes; Game theory; Optimization; Uncertainty; Methodology; Policies;

机译：决策;算法;随机过程;数学模型;论文;并行处理;迭代;马尔可夫过程;博弈论;优化;不确定性;方法论;政策;

相似文献

外文文献
中文文献
专利

1. Delayed Reward-Based Genetic Algorithms for Partially Observable Markov Decision Problems [J] . Yoshihide Yamashiro, Atsushi Ueno, Hideaki Takeda Systems and Computers in Japan . 2004,第2期

机译：局部可观马尔可夫决策问题的基于延迟奖励的遗传算法
2. PALO bounds for reinforcement learning in partially observable stochastic games [J] . Ceren Roi, He Keyang, Doshi Prashant, Neurocomputing . 2021,第Jana8期

机译：Palo界限为部分可观察到的随机游戏中的加固学习
3. Optimizing honeypot strategies against dynamic lateral movement using partially observable stochastic games [J] . Horak Karel, Bosansky Branislav, Tomasek Petr, Computers & Security . 2019,第Nova期

机译：使用部分可观察的随机博弈优化蜜罐策略以防止动态横向移动
4. Using One-Sided Partially Observable Stochastic Games for Solving Zero-Sum Security Games with Sequential Attacks [C] . Petr Tomasek, Branislav Bosansky, Thanh H. Nguyen International Conference on Decision and Game Theory for Security . 2020

机译：使用单面部分可观察的随机游戏来解决零和安全游戏的顺序攻击
5. Increasing scalability in algorithms for centralized and decentralized partially observable Markov decision processes: Efficient decision-making and coordination in uncertain environments. [D] . Amato, Christopher. 2010

机译：用于集中式和分散式部分可观察的马尔可夫决策过程的算法中的可伸缩性不断增强：在不确定的环境中进行有效的决策和协调。
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Efficient On-the-Fly Algorithms for Partially Observable Timed Games [O] . Franck Cassez 2007

机译：局部可观察定时游戏的高效动态算法

New Algorithms for Collaborative and Adversarial Decision Making in Partially Observable Stochastic Games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅