【24h】

Solving Factored MDPs via Non-Homogeneous Partitioning

机译:通过非均匀分区解因式MDP

获取原文

摘要

This paper describes an algorithm for solving large state-space MDPs (represented as factored MDPs) using search by successive refinement in the space of non-homogeneous partitions. Homogeneity is defined in terms of bisimulation and reward equivalence within blocks of a partition. Since homogeneous partitions that define equivalent reduced state-space MDPs can have a large number of blocks, we relax the requirement of homogeneity. The algorithm constructs approximate aggregate MDPs from non-homogeneous partitions, solves the aggregate MDPs exactly, and then uses the resulting value functions as part of a heuristic in refining the current best non-homogeneous partition. We outline the theory motivating the use of this heuristic and present empirical results and comparisons.
机译:本文介绍了一种算法,该算法通过在非均匀分区空间中进行逐次细化来搜索来解决大型状态空间MDP(表示为分解MDP)。均质性是根据分区块内的双仿真和奖励等效性来定义的。由于定义等效缩减状态空间MDP的同质分区可以具有大量块,因此我们放宽了对同质性的要求。该算法从非均匀分区构造近似的聚合MDP,精确求解聚合MDP,然后将结果值函数用作启发式算法的一部分,以细化当前最佳的非均匀分区。我们概述了激发这种启发式方法使用的理论,并提出了实证结果和比较结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号