【24h】

Three New Algorithms to Solve N-POMDPs

机译:三个新算法来解决n-pomdps

获取原文

摘要

In many fields in computational sustainability, applications of POMDPs are inhibited by the complexity of the optimal solution. One way of delivering simple solutions is to represent the policy with a small number of α-vectors. We would like to find the best possible policy that can be expressed using a fixed number N of α-vectors. We call this the N-POMDP problem. The existing solver α-min approximately solves finite-horizon POMDPs with a controllable number of α-vectors. However α-min is a greedy algorithm without performance guarantees, and it is rather slow. This paper proposes three new algorithms, based on a general approach that we call α-min-2. These three algorithms are able to approximately solve N-POMDPs. α-min-2-fast (heuristic) and α-min-2-p (with performance guarantees) are designed to complement an existing POMDP solver, while α-min-2-solve (heuristic) is a solver itself. Complexity results are provided for each of the algorithms, and they are tested on well-known benchmarks. These new algorithms will help users to interpret solutions to POMDP problems in computational sustainability.
机译:在计算可持续性中的许多领域中,POMDP的应用受到最佳解决方案的复杂性的抑制。提供简单解决方案的一种方法是表示具有少量α-载体的策略。我们想找到可以使用固定数N的α - 向量表达的最佳策略。我们称之为N-POMDP问题。现有求解器α-min近似求解有限地平线POMDP,具有可控数量的α-载体。然而,α-min是一种没有性能保证的贪婪算法,它相当慢。本文提出了三种新的算法,基于我们称之为α-min-2的一般方法。这三种算法能够大致解决n-pomdps。 α-min-2 - 快(启发式)和α-min-2-p(具有性能保证)旨在补充现有的POMDP求解器,而α-min-2求解(启发式)是一个求解器本身。为每个算法提供复杂性结果,并在众所周知的基准测试中进行测试。这些新算法将帮助用户解释对计算可持续性的POMDP问题的解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号