The Size of MDP Factored Policies

机译：MDP因素政策的规模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Policies of Markov Decision Processes (MDPs) tell the next action to execute, given the current state and (possibly) the history of actions executed so far. Factorization is used when the number of states is exponentially large: both the MDP and the policy can be then represented using a compact form, for example employing circuits. We prove that there are MDPs whose optimal policies require exponential space even in factored form.

机译：Markov决策过程（MDPS）的策略告诉下一个执行的操作，给定当前状态和（可能）到目前为止执行的操作历史。当状态的数量是指数大的时，使用分解：然后可以使用紧凑的形式表示MDP和策略，例如采用电路。我们证明有MDP，其最佳政策即使是因子形式也需要指数空间。

著录项

来源
《National Conference on Artificial Intelligence》|2002年||共6页
会议地点
作者
Paolo Liberatore;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. On Polynomial Sized MDP Succinct Policies [J] . Liberatore P. The Journal of Artificial Intelligence Research . 2004,第12期

机译：关于多项式大小的MDP简洁策略
2. On Polynomial Sized MDP Succinct Policies [J] . Paolo Liberatore The Journal of Artificial Intelligence Research . 2004,第0期

机译：关于多项式大小的MDP简洁策略
3. Efficient approximate linear programming for factored MDPs [J] . Chen Feng, Cheng Qiang, Dong Jianwu, 高分子論文集 . 2015,第auga期

机译：分解式MDP的高效近似线性编程
4. The Size of MDP Factored Policies [C] . Paolo Liberatore National Conference on Artificial Intelligence . 2002

机译：MDP因素政策的规模
5. Active Cyber Deception and Attacker Intent Recognition Using Factored Interactive POMDPs [D] . Shinde, Aditya P. 2020

机译：有源网络欺骗和攻击者使用因子互动POMDPS的意图识别
6. MDPs with Non-Deterministic Policies [O] . Mahdi Milani Fard, Joelle Pineau -1

机译：具有不确定性策略的MDP
7. On Polynomial Sized MDP Succinct Policies [O] . Paolo Liberatore 2013

机译：关于多项式mDp简洁政策

The Size of MDP Factored Policies

摘要

著录项

相似文献

相关主题

期刊订阅