Coevolutive Planning in Markov Decision Processes

机译：马尔可夫决策过程中的协同进化规划

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate the idea of having groups of agents coevolv-ing in order to iteratively refine multi-agent plans. This idea we called coevolution is formalized and analyzed in a general purpose and applied to the stochastic control frameworks that use an explicit model of the world: coevolution can directly be adapted to the frameworks of Multi-Agent Markov Decision Processes (MMDP) and Multi-Agent Partially Observable MDP (MPOMDP). We also consider the decentralized version of MPOMDP (DEC-POMDP) which is known to be a difficult problem : we show that the coevolution approach can be applied if we restrict the search to memoryless policies. We evaluate our coevolutive approach experimentally on a typical multi-agent problem.

机译：我们研究了让代理商群体协同发展的想法，以迭代地完善多代理商计划。我们称之为协同进化的想法已在一般用途中进行了形式化和分析，并应用于使用世界显式模型的随机控制框架：协同进化可直接适用于多代理马尔可夫决策过程（MMDP）和多目标代理部分可观察的MDP（MPOMDP）。我们还考虑了MPOMDP的分散版本（DEC-POMDP），这是一个困难的问题：我们表明，如果我们将搜索限制在无内存策略中，则可以应用协同进化方法。我们在典型的多主体问题上通过实验评估了协同进化方法。

著录项

来源
《First International Joint Conference on Autonomous Agents and Multiagent Systems Pt.2, Jul 15-19, 2002, Bologna, Italy》|2002年|p.843-844|共2页
会议地点 Bologna(IT);Bologna(IT);Bologna(IT);Bologna(IT);Bologna(IT);Bologna(IT);Bologna(IT);Bologna(IT)
作者
Bruno Scherrer; Francois Charpillet;
展开▼
作者单位

LORIA, Campus Scientifique BP239 -F54506 Vandoeuvre-Ies-Nancy;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
action selection and planning; coordinating multi-agent activites; evolution adaptation and learning; markov decision processes;

机译：行动选择和计划；协调多主体和活动；进化适应和学习；马可夫决策过程;

相似文献

外文文献
中文文献
专利

1. Optimal decisions for continuous time Markov decision processes overn finite planning horizons [J] . Buchholz Peter, Dohndorf Iryna, Scheftelowitsch Dimitri Computers & operations research . 2017,第jana期

机译：有限规划范围内连续时间马尔可夫决策过程的最优决策
2. Planning using hierarchical constrained Markov decision processes [J] . Feyzabadi Seyedshams, Carpin Stefano Autonomous robots . 2017,第8期

机译：使用分层约束的马尔可夫决策过程计划
3. Ground Delay Program Planning Using Markov Decision Processes [J] . Cox Jonathan, Kochenderfer Mykel J. Journal of Aerospace Computing, Information, and Communication . 2016,第3期

机译：使用马尔可夫决策过程的地面延迟程序计划
4. Coevolutive planning in markov decision processes [C] . Bruno Scherrer, Francois Charpillet International joint conference on Autonomous agents and multiagent systems . 2002

机译：马可夫决策过程中的协同进化规划
5. Robot Planning with Constrained Markov Decision Processes [D] . Feyzabadi, Seyedshams. 2017

机译：约束马尔可夫决策过程的机器人规划
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. A Variational Perturbative Approach to Planning in Graph-Based Markov Decision Processes [O] . Dominik Linzner, Heinz Koeppl 2020

机译：基于图形的马尔可夫决策过程规划的变分刺痛方法
8. Two Short Notes on Markov Processes: I. A Test for Sub-Optimal Actions in Markovian Decision Problems. II. An Intrinsically Determined Markov Chain [R] . MacQueen, J. B. 1966

机译：关于马尔可夫过程的两个简短说明：I。马尔可夫决策问题中次优最优行动的检验。 II。本质上确定的马尔可夫链

Coevolutive Planning in Markov Decision Processes

摘要

著录项

相似文献

相关主题

期刊订阅