首页> 美国政府科技报告 >Combining Offline and Online Computation for Solving Partially Observable Markov Decision Process.

【24h】

Combining Offline and Online Computation for Solving Partially Observable Markov Decision Process.

机译：结合离线和在线计算求解部分可观测马尔可夫决策过程。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Partially observable Markov decision process (POMDP) provides a general and mathematically elegant way of formulating planning and control problems under uncertainty. Unfortunately, POMDPs are computationally intractable to solve in the worst case, prompting the development of approximation algorithms. In this project, they explored the use of online algorithms for approximately solving large-scale POMDPs. They developed a new online POMDP solver, DESPOT, with good theoretical and practical properties. The DESPOT algorithm was used as part of our entry that finished in first place at the ICAPS 2014 International Probabilistic Planning Competition (IPPC) POMDP track. They also applied the DESPOT algorithm on the problem of autonomous vehicle navigation through crowded locations, demonstrating its use in a real application. Although POMDPs are intractable in the worst case, there are subclasses of POMDPs that can be tractably approximated and are at the same time practically interesting. They applied online methods to two such special cases of POMDPs, specifically adaptive informative path planning and active learning, obtaining practical polynomial-time algorithms with guaranteed approximation bounds.

著录项

作者
Lee, W. S.;
展开▼
作者单位

展开▼
年度 2015
页码 1-11
总页数 11
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Adaptive control systems; Decision making; Markov processes; Algorithms; Approximation(mathematics); Autonomous navigation; Bayes theorem; Computerized simulation; Experimental design; Learning machines; Multisensors; Online systems; Optimization; Paths; Pattern recognition; Polynomials; Position(location); Probability distribution functions; Robotics; Uncertainty; Vehicles; Pomdp(partial observable markov decision process); Online decision making; Belief tree; Sampling; Active learning; Path finding; Autonomous vehicle navigation; Reinforcement learning; Pe61102f;

机译：自适应控制系统;决策;马尔可夫过程;算法;近似（数学）;自主导航;贝叶斯定理;计算机模拟;实验设计;学习机;多传感器;在线系统;优化;路径;模式识别;多项式;位置（位置） ;概率分布函数;机器人;不确定性;车辆; pomdp（部分可观测马尔可夫决策过程）;在线决策;信念树;抽样;主动学习;路径寻找;自主车辆导航;强化学习; pe61102f;

相似文献

外文文献
中文文献
专利

1. Parallel rollout for online solution of partially observable Markov decision processes [J] . Chang HS, Givan R, Chong EKP Discrete event dynamic systems: Theory and applications . 2004,第3期

机译：并行推出部分可观察的马尔可夫决策过程的在线解决方案
2. The Optimal Observability of Partially Observable Markov Decision Processes: Discrete State Space [J] . Rezaeian M.Vo B.-N.Evans J. S. Automatic Control, IEEE Transactions on . 2010,第12期

机译：部分可观马尔可夫决策过程的最优可观性：离散状态空间
3. Monotonicity properties for two-action partially observable Markov decision processes on partially ordered spaces [J] . European Journal of Operational Research . 2020,第3期

机译：两个动作部分可观察到的Markov决策过程的单调性属性在部分有序空间上
4. Online Active Perception for Partially Observable Markov Decision Processes with Limited Budget [C] . Mahsa Ghasemi, Ufuk Topcu IEEE Annual Conference on Decision and Control . 2019

机译：在线可观察到的马尔可夫决策过程的在线积极感知，预算有限
5. Exploiting structure to efficiently solve large scale partially observable Markov decision processes. [D] . Poupart, Pascal. 2005

机译：利用结构有效地解决大规模可部分观测的马尔可夫决策过程。
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Assisting persons with dementia during handwashing using a partially observable Markov decision process. [O] . Hoey Jesse, Bertoldi Axel von, Poupart Pascal, 2007

机译：使用部分可观察到的马尔可夫决策过程协助洗手期间的痴呆症患者。

Combining Offline and Online Computation for Solving Partially Observable Markov Decision Process.

摘要

著录项

相似文献

相关主题

期刊订阅