Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning

机译：产生良好计划的学习方法：HTN学习与强化学习的整合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider how to learn Hierarchical Task Networks (HTNs) for planning problems in which both the quality of solution plans generated by the HTNs and the speed at which those plans are found is important. We describe an integration of HTN Learning with Reinforcement Learning to both learn methods by analyzing semantic annotations on tasks and to produce estimates of the expected values of the learned methods by performing Monte Carlo updates. We performed an experiment in which plan quality was inversely related to plan length. In two planning domains, we evaluated the planning performance of the learned methods in comparison to two state-of-the-art satisficing classical planners, FastForward and SgPlan6, and one optimal planner, HSP_F. The results demonstrate that a greedy HTN planner using the learned methods was able to generate higher quality solutions than SgPlan6 in both domains and FASTFORWARD in one. Our planner, FastForward, and SGPLAN6 ran in similar time, while HSP_F~* was exponentially slower.

机译：我们考虑如何学习用于计划问题的分层任务网络（HTN），在这些问题中，由HTN生成的解决方案计划的质量和找到这些计划的速度都非常重要。我们描述了HTN学习与强化学习的集成，以通过分析任务上的语义注释来学习方法，并通过执行蒙特卡洛更新来生成学习方法的期望值的估计值。我们进行了一项实验，其中计划质量与计划长度成反比。在两个规划领域中，我们与两个最先进的令人满意的经典规划器FastForward和SgPlan6以及一个最佳规划器HSP_F进行了比较，评估了所学方法的规划性能。结果表明，使用学习方法的贪婪的HTN计划程序能够在两个域中都比SgPlan6和FASTFORWARD中的一个生成更高质量的解决方案。我们的计划程序FastForward和SGPLAN6的运行时间相似，而HSP_F〜*的运行速度却成倍下降。

著录项

来源
《Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;IAAI-10;Symposium on educational advances in artificial intelligence;AAAI-10;EAAI-10》|2011年|p.1530-1535|共6页
会议地点
作者
Chad Hogg; Ugur Kuter; Hector Munoz-Avila;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. On Reinforcement Learning Methods for Generating Train Marshaling Plan Considering Group Layout of Freight Cars [J] . Yoichi Hiroshima IAENG Internaitonal journal of computer science . 2012,第3期

机译：考虑货车组布局的列车编组计划的强化学习方法
2. On Reinforcement Learning Methods for Generating Train Marshaling Plan Considering Group Layout of Freight Cars [J] . Yoichi Hirashima Australasian Plant Disease Notes . 2012,第3期

机译：考虑货车组布局的列车编组计划的强化学习方法
3. Relevant experience learning:A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments [J] . Zijian HU, Xiaoguang GAO, Kaifang WAN, 中国航空学报（英文版） . 2021,第012期

机译：相关体验学习：复杂未知环境中无人机自主运动规划的深度加强学习方法
4. Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning [C] . Chad Hogg, Ugur Kuter, Hector Munoz-Avila AAAI Conference on Artificial Intelligence . 2010

机译：生成良好计划的学习方法：整合HTN学习和加强学习
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. The Efficacy of Three Learning Methods Collaborative Context-Based Learning and Traditional on Learning Attitude and Behaviour of Undergraduate Nursing Students: Integrating Theory and Practice [O] . Ali Hasanpour-Dehkordi, Kamal Solati 2016

机译：三种学习方法的协作基于上下文的学习和传统的学习对本科护理学生学习态度和行为的功效：理论与实践的结合
7. A Comparison Of Supervised And Reinforcement Learning Methods On A Reinforcement Learning Task [O] . Vijaykumar Gullapalli 1992

机译：强化学习任务中监督学习和强化学习方法的比较
8. CaMeL: Learning Method Preconditions for HTN Planning. [R] . Ilghami, O., Nau, D. S., Munoz, Avila, H., 2006

机译：CameL：HTN规划的学习方法前提条件。

Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅