Optimally Solving Dec-POMDPs as Continuous-State MDPs

机译：最佳地将Dec-POMDP解决为连续状态MDP

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Optimally solving decentralized partially observable Markov decision processes (Dec-POMDPs) is a hard combinatorial problem.Current algorithms search through the space of full histories for each agent.Because of the doubly exponential growth in the number of policies in this space as the planning horizon increases,these methods quickly become intractable.However,in real world problems,computing policies over the full history space is often unnecessary.True histories experienced by the agents often lie near a structured,low-dimensional manifold embedded into the history space.We show that by transforming a Dec-POMDP into a continuous-state MDP,we are able to find and exploit these low-dimensional representations.Using this novel transformation,we can then apply powerful techniques for solving POMDPs and continuous-state MDPs.By combining a general search algorithm and dimension reduction based on feature selection,we introduce a novel approach to optimally solve problems with significantly longer planning horizons than previous methods.

机译：最优地解决分散的，部分可观察的马尔可夫决策过程（Dec-POMDPs）是一个困难的组合问题。当前的算法在每个代理的完整历史记录空间中进行搜索，因为该空间中的策略数量以计划范围成倍增长但是，在现实世界中，通常不需要在整个历史空间上计算策略。代理所经历的真实历史通常位于嵌入历史空间的结构化，低维流形附近。通过将Dec-POMDP转换为连续状态MDP，我们能够找到并利用这些低维表示。使用这种新颖的转换，我们可以应用强大的技术来解决POMDP和连续状态MDP。通用搜索算法和基于特征选择的降维，我们引入了一种新颖的方法来最优地解决具有重要意义的问题与以前的方法相比，规划时间更长。

著录项

来源
《International joint conference on artificial intelligence》|2013年|90-96|共7页
会议地点 Beijing(CN)
作者
Jilles Steeve Dibangoye; Christopher Amato; Olivier Buffet; Fran(c)ois Charpillet;
展开▼
作者单位

Inria / Université de Lorraine Nancy France;

CSAIL / MIT Cambridge MA USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Optimally Solving Dec-POMDPs as Continuous-State MDPs [J] . Amato Christopher, Buffet Olivier, Charpillet Fran#231, The Journal of Artificial Intelligence Research . 2016,第10期

机译：最佳地将Dec-POMDP解决为连续状态MDP
2. Optimally Solving Dec-POMDPs as Continuous-State MDPs [J] . Dibangoye Jilles Steeve, Amato Christopher, Buffet Olivier, The Journal of Artificial Intelligence Research . 2016,第Null期

机译：最佳地将Dec-POMDP解决为连续状态MDP
3. Using evolution strategies to solve DEC-POMDP problems [J] . Barış Eker, H. Levent Akın Soft Computing - A Fusion of Foundations, Methodologies and Applications . 2010,第1期

机译：使用进化策略解决DEC-POMDP问题
4. Optimally Solving Dec-POMDPs as Continuous-State MDPs [C] . Jilles Steeve Dibangoye, Christopher Amato, Olivier Buffet, International Joint Conference on Artificial Intelligence . 2013

机译：最佳地解决DEC-POMDP作为连续状态MDPS
5. Classification problems for MDPs and optimal customer admission to M/M/k/N queues. [D] . Yang, Fenghsu. 2010

机译：MDP的分类问题和M / M / k / N队列的最佳客户准入。
6. Forward and Backward Bellman Equations Improve the Efficiency of the EM Algorithm for DEC-POMDP [O] . Takehiro Tottori, Tetsuya J. Kobayashi 2021

机译：向前和后退Bellman方程提高了DEC-POMDP的EM算法的效率
7. Optimally Solving Dec-POMDPs as Continuous-State MDPs [O] . Dibangoye, Jilles Steeve, Amato, Christopher, Buffet, Olivier, 2016

机译：最佳地将Dec-POMDP解决为连续状态MDP

Optimally Solving Dec-POMDPs as Continuous-State MDPs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅