Solving Factored MDPs via Non-Homogeneous Partitioning

机译：通过非均匀分区解因式MDP

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an algorithm for solving large state-space MDPs (represented as factored MDPs) using search by successive refinement in the space of non-homogeneous partitions. Homogeneity is defined in terms of bisimulation and reward equivalence within blocks of a partition. Since homogeneous partitions that define equivalent reduced state-space MDPs can have a large number of blocks, we relax the requirement of homogeneity. The algorithm constructs approximate aggregate MDPs from non-homogeneous partitions, solves the aggregate MDPs exactly, and then uses the resulting value functions as part of a heuristic in refining the current best non-homogeneous partition. We outline the theory motivating the use of this heuristic and present empirical results and comparisons.

机译：本文介绍了一种算法，该算法通过在非均匀分区空间中进行逐次细化来搜索来解决大型状态空间MDP（表示为分解MDP）。均质性是根据分区块内的双仿真和奖励等效性来定义的。由于定义等效缩减状态空间MDP的同质分区可以具有大量块，因此我们放宽了对同质性的要求。该算法从非均匀分区构造近似的聚合MDP，精确求解聚合MDP，然后将结果值函数用作启发式算法的一部分，以细化当前最佳的非均匀分区。我们概述了激发这种启发式方法使用的理论，并提出了实证结果和比较结果。

著录项

来源
《Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01) Vol.1, Aug 4-10, 2001, Seattle, Washington》|2001年|p.683-689|共7页
会议地点
作者
Kee-Eung Kim; Thomas Dean;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词
入库时间 2022-08-26 14:07:17

相似文献

外文文献
中文文献
专利

1. Solving factored MDPs using non-homogeneous partitions [J] . Kee-Eung Kim, Thomas Dean Artificial intelligence . 2003,第1a2期

机译：使用非均匀分区求解分解式MDP
2. Solving Factored MDPs with Hybrid State and Action Variables [J] . Guestrin C., Hauskrecht M., Kveton B. The Journal of Artificial Intelligence Research . 2006,第5期

机译：用混合状态和动作变量求解分解式MDP
3. Solving Factored MDPs with Hybrid State and Action Variables [J] . B. Kveton, M. Hauskrecht, C. Guestrin Journal of Automation, Mobile Robotics & Intelligent Systems . 2006,第5期

机译：用混合状态和动作变量求解分解式MDP
4. Solving Factored MDPs via Non-Homogeneous Partitioning [C] . Kee-Eung Kim, Thomas Dean International Joint Conference on Artificial Intelligence . 2007

机译：通过非同质分区解决因子MDP
5. Point-Based POMDP Solvers: Survey and Comparative Analysis. [D] . Kaplow, Robert. 2010

机译：基于点的POMDP解决方案：调查和比较分析。
6. Partially non-homogeneous dynamic Bayesian networks based on Bayesian regression models with partitioned design matrices [O] . Mahdi Shafiee Kamalabad, Alexander Martin Heberle, Kathrin Thedieck, -1

机译：基于带划分设计矩阵的贝叶斯回归模型的部分非均匀动态贝叶斯网络
7. Solving factored MDPs using non-homogeneous partitions [O] . Kim Kee-Eung, Dean Thomas 2003

机译：使用非均匀分区求解分解式MDP

Solving Factored MDPs via Non-Homogeneous Partitioning

摘要

著录项

相似文献

相关主题

期刊订阅