Three New Algorithms to Solve N-POMDPs

机译：三个新算法来解决n-pomdps

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many fields in computational sustainability, applications of POMDPs are inhibited by the complexity of the optimal solution. One way of delivering simple solutions is to represent the policy with a small number of α-vectors. We would like to find the best possible policy that can be expressed using a fixed number N of α-vectors. We call this the N-POMDP problem. The existing solver α-min approximately solves finite-horizon POMDPs with a controllable number of α-vectors. However α-min is a greedy algorithm without performance guarantees, and it is rather slow. This paper proposes three new algorithms, based on a general approach that we call α-min-2. These three algorithms are able to approximately solve N-POMDPs. α-min-2-fast (heuristic) and α-min-2-p (with performance guarantees) are designed to complement an existing POMDP solver, while α-min-2-solve (heuristic) is a solver itself. Complexity results are provided for each of the algorithms, and they are tested on well-known benchmarks. These new algorithms will help users to interpret solutions to POMDP problems in computational sustainability.

机译：在计算可持续性中的许多领域中，POMDP的应用受到最佳解决方案的复杂性的抑制。提供简单解决方案的一种方法是表示具有少量α-载体的策略。我们想找到可以使用固定数N的α - 向量表达的最佳策略。我们称之为N-POMDP问题。现有求解器α-min近似求解有限地平线POMDP，具有可控数量的α-载体。然而，α-min是一种没有性能保证的贪婪算法，它相当慢。本文提出了三种新的算法，基于我们称之为α-min-2的一般方法。这三种算法能够大致解决n-pomdps。 α-min-2 - 快（启发式）和α-min-2-p（具有性能保证）旨在补充现有的POMDP求解器，而α-min-2求解（启发式）是一个求解器本身。为每个算法提供复杂性结果，并在众所周知的基准测试中进行测试。这些新算法将帮助用户解释对计算可持续性的POMDP问题的解释。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2017年|4348-5137p|共7页
会议地点
作者
Yann Dujardin; Tom Dietterich; Iadine Chades;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Solving Large Problem Sizes of Index-Digit Algorithms on GPU: FFT and Tridiagonal System Solvers [J] . Adrián Pérez Diéguez, Margarita Amor, Jacobo Lobeiras, IEEE Transactions on Computers . 2018,第1期

机译：在GPU上解决索引算法大问题的规模：FFT和三对角线系统求解器
2. A Class of Generalized Approximate Inverse Solvers for Unsymmetric Linear Systems of Irregular Structure Based on Adaptive Algorithmic Modelling for Solving Complex Computational Problems in Three Space Dimensions [J] . Anastasia-Dimitra Lipitakis Applied Mathematics . 2016,第11期

机译：基于自适应算法建模的不规则结构非对称线性系统的一类广义近似逆解。
3. A Class of Generalized Approximate Inverse Solvers for Unsymmetric Linear Systems of Irregular Structure Based on Adaptive Algorithmic Modelling for Solving Complex Computational Problems in Three Space Dimensions [J] . Anastasia-Dimitra Lipitakis Applied Mathematics . 2016,第11期

机译：基于自适应算法建模的不规则结构非对称线性系统的一类广义近似逆解。
4. Three New Algorithms to Solve N-POMDPs [C] . Yann Dujardin, Tom Dietterich, Iadine Chades AAAI Conference on Artificial Intelligence . 2017

机译：三个求解n-pomdps的新算法
5. The Effects of Mastery of Editing Peers' Written Math Algorithms on Producing Effective Problem Solving Algorithms. [D] . Weber, Jennifer Danielle. 2016

机译：精通编辑同行书面数学算法对产生有效的问题解决算法的影响。
6. Accuracy of compact-stencil interpolation algorithms for unstructured mesh finite volume solver [O] . Adek Tasri, Anita Susilawati 2021

机译：对非结构化网格有限音量求解器的紧凑型模板插值算法的精度
7. A Survey of Solving SVP Algorithms and Recent Strategies for Solving the SVP Challenge [O] . Masaya Yasuda 2020

机译：解决SVP算法的调查和解决SVP挑战的最新策略

Three New Algorithms to Solve N-POMDPs

摘要

著录项

相似文献

相关主题

期刊订阅