An MDP-based policy for stochastic multi-agent domains

机译：基于MDP的随机多算子域的策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stochastic environments pose challenging problems for agents trying to act optimally in the presence of other agents. In such environments agents have to contend with the probabilistic effects of other agents' actions, their inability tocompletely observe the state of the world before selecting the next action and in some cases the high cost of communication. In this paper, we show how such systems can be modeled as multi-agent Markov decison processes. We describe a policy thatprescribes an action that has a high probability of being the optimal action under a given global state distribution and present an algorithm that agents can use to act in such environments while attempting to achieve their goals.

机译：随机环境对试图在其他代理商的存在下最佳行动的代理构成具有挑战性的问题。在这种环境中，代理人必须抗争解其他代理人的行动的概率影响，在选择下一个行动之前，他们无法遵守世界的国家，并且在某些情况下沟通的高成本。在本文中，我们展示了这种系统如何建模为多代理马尔可夫甲板流程。我们描述了一项规范，该政策专项是在给定的全局状态分布下的最佳行动的概率高概率的策略，并且在尝试实现目标时，代理可以使用代理商可以用于在这种环境中起作用的算法。

著录项

来源
《IEEE International Conference on System, Man, and Cybernetics》|1999年||共5页
会议地点
作者
Pradeep M. Pappachan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP27-53;
关键词

相似文献

外文文献
中文文献
专利

1. Exploiting symmetries for single- and multi-agent Partially Observable Stochastic Domains [J] . Byung Kon Kang, Kee-Eung Kim Artificial intelligence . 2012,第期

机译：利用单主体和多主体部分可观察的随机域的对称性
2. A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem [J] . Paternina-Arboleda CD, Das TK Simulation modelling practice and theory: International journal of the Federation of European Simulation Societies . 2005,第5期

机译：一种用于随机批次调度问题的动态控制策略的多主体强化学习方法
3. MDP-based handover policy in wireless relay systems [J] . Xiaoyu Dang, Jin-Yuan Wang, Zhe Cao Eurasip Journal on Wireless Communications and Networking . 2012,第1期

机译：无线中继系统中基于MDP的切换策略
4. An MDP-based policy for stochastic multi-agent domains [C] . Pappachan, P.M. . 1999

机译：随机多代理域的基于MDP的策略
5. Exploiting Stochasticity in Multi-agent Systems. [D] . Mesquita, Alexandre Rodrigues. 2010

机译：在多主体系统中利用随机性。
6. Advancing a health equity agenda across multiple policy domains: a qualitative policy analysis of social trade and welfare policy [O] . Belinda Townsend, Sharon Friel, Toby Freeman, 2020

机译：跨越多项政策领域推进健康股权议程：社会贸易和福利政策的定性政策分析
7. Exploiting symmetries for single- and multi-agent Partially Observable Stochastic Domains [O] . Kang Byung Kon, Kim Kee-Eung 2012

机译：为单主体和多主体部分可观察的随机域开发对称性
8. Application of Fuzzy State Aggregation and Policy Hill Climbing to Multi-Agent Systems in Stochastic Environments [R] . Wardell, D. C. 2006

机译：模糊状态聚合和策略爬坡在随机环境下多agent系统中的应用

An MDP-based policy for stochastic multi-agent domains

摘要

著录项

相似文献

相关主题

期刊订阅