首页> 外文期刊>Knowledge-Based Systems >The hierarchical task network planning method based on Monte Carlo Tree Search
【24h】

The hierarchical task network planning method based on Monte Carlo Tree Search

机译:基于Monte Carlo树搜索的分层任务网络规划方法

获取原文
获取原文并翻译 | 示例
           

摘要

Since the hierarchical task network (HTN) planning depends on the domain knowledge of the problem, the planning result relies on the writing order of the decomposition method. Besides, the solution obtained by planning is usually a general feasible solution, which means there are shortcomings in the ability of finding the optimal solution. In order to reduce the dependence of HTN planning on domain knowledge and obtain a better planning solution, Pyhop-m, an HTN planning algorithm based on Monte Carlo Tree Search(MCTS) is proposed. In the planning process, a planning tree is built by MCTS to guide the HTN planner to choose the best decomposition method. Experiments illustrates that whether in the static or dynamic environment, Pyhop-m is superior to the existing Pyhop and heuristic-based Pyhop-h in plan length, planning success rate and optimal solution rate. Under the 95% confidence level, the confidence intervals of Pyhop-m algorithm to achieve the planning success rate and the optimal solution rate in the dynamic environment are [75.82%,89.18%] and [88.67%,93.95%], which are significantly higher than those of Pyhop-h with [58.19%,77.81%] and [69.91%,80.69%], respectively. Moreover, it can solve the planning problem with uncertain action executions by repeatedly simulating and evaluating the leaf nodes of the planning tree. It can be concluded that Pyhop-m can not only make the planning result independent of the writing order of the decomposition methods, but also search out the global optimal solution. (C) 2021 Elsevier B.V. All rights reserved.
机译:由于分层任务网络(HTN)规划取决于问题的域知识,因此规划结果依赖于分解方法的写入顺序。此外,通过规划获得的解决方案通常是一般可行的解决方案,这意味着寻找最佳解决方案的能力存在缺点。为了减少HTN规划对域知识的依赖性并获得更好的计划解决方案,PyHop-M,基于蒙特卡罗树搜索(MCTS)的HTN规划算法。在规划过程中,MCT建立了一个规划树,以指导HTN规划器选择最佳分解方法。实验说明是否在静态或动态环境中,PYHOP-M优于现有的PYHOP和基于启发式的PYHOP-H,规划成功率和最佳解决方案速率。在95%的置信水平下,PYHOP-M算法的置信区间实现了规划成功率和动态环境中的最佳解决方案率为[75.82%,89.18%]和[88.67%,93.95%]显着高于PyHop-H的[58.19%,77.81%]和[69.91%,80.69%]。此外,它可以通过重复模拟和评估规划树的叶节点来解决不确定行动执行的规划问题。可以得出结论,PyHop-M不仅可以与分解方法的写作顺序无关,还可以不仅使规划结果无关,而且还搜索全局最优解决方案。 (c)2021 elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号