首页> 外文会议>Annual conference on Neural Information Processing Systems >Trajectory-Based Short-Sighted Probabilistic Planning
【24h】

Trajectory-Based Short-Sighted Probabilistic Planning

机译:基于轨迹的短视概率计划

获取原文

摘要

Probabilistic planning captures the uncertainty of plan execution by probabilistically modeling the effects of actions in the environment, and therefore the probability of reaching different states from a given state and action. In order to compute a solution for a probabilistic planning problem, planners need to manage the uncertainty associated with the different paths from the initial state to a goal state. Several approaches to manage uncertainty were proposed, e.g., consider all paths at once, perform determinization of actions, and sampling. In this paper, we introduce trajectory-based short-sighted Stochastic Shortest Path Problems (SSPs), a novel approach to manage uncertainty for probabilistic planning problems in which states reachable with low probability are substituted by artificial goals that heuristically estimate their cost to reach a goal state. We also extend the theoretical results of Short-Sighted Probabilistic Planner (SSiPP) [1] by proving that SSiPP always finishes and is asymptotically optimal under sufficient conditions on the structure of short-sighted SSPs. We empirically compare SSiPP using trajectory-based short-sighted SSPs with the winners of the previous probabilistic planning competitions and other state-of-the-art planners in the triangle tireworld problems. Trajectory-based SSiPP outperforms all the competitors and is the only planner able to scale up to problem number 60, a problem in which the optimal solution contains approximately 10~(70) states.
机译:概率计划通过对环境中的动作效果进行概率模型化来捕获计划执行的不确定性,从而捕获从给定状态和动作中达到不同状态的概率。为了计算概率规划问题的解决方案,规划人员需要管理与从初始状态到目标状态的不同路径相关的不确定性。提出了几种管理不确定性的方法,例如,一次考虑所有路径,执行动作确定和采样。在本文中,我们介绍了基于轨迹的近视随机最短路径问题(SSP),这是一种管理概率规划问题不确定性的新方法,其中概率较低的状态被人工目标代替,这些目标通过启发式估计其达到目标的成本。目标状态。我们还证明了SSiPP始终完成并且在足够条件下对近视SSP的结构是渐近最优的,从而扩展了近视概率计划器(SSiPP)[1]的理论结果。我们根据经验将使用基于轨迹的近视SSP的SSiPP与以前的概率规划竞赛的获胜者以及其他在三角形轮胎世界问题中最先进的规划师进行比较。基于轨迹的SSiPP胜过所有竞争对手,并且是唯一能够扩展到第60个问题的计划者,该问题的最佳解决方案包含大约10〜(70)个状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号