首页> 美国政府科技报告 >Combining Offline and Online Computation for Solving Partially Observable Markov Decision Process.
【24h】

Combining Offline and Online Computation for Solving Partially Observable Markov Decision Process.

机译:结合离线和在线计算求解部分可观测马尔可夫决策过程。

获取原文

摘要

Partially observable Markov decision process (POMDP) provides a general and mathematically elegant way of formulating planning and control problems under uncertainty. Unfortunately, POMDPs are computationally intractable to solve in the worst case, prompting the development of approximation algorithms. In this project, they explored the use of online algorithms for approximately solving large-scale POMDPs. They developed a new online POMDP solver, DESPOT, with good theoretical and practical properties. The DESPOT algorithm was used as part of our entry that finished in first place at the ICAPS 2014 International Probabilistic Planning Competition (IPPC) POMDP track. They also applied the DESPOT algorithm on the problem of autonomous vehicle navigation through crowded locations, demonstrating its use in a real application. Although POMDPs are intractable in the worst case, there are subclasses of POMDPs that can be tractably approximated and are at the same time practically interesting. They applied online methods to two such special cases of POMDPs, specifically adaptive informative path planning and active learning, obtaining practical polynomial-time algorithms with guaranteed approximation bounds.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号