Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning

机译：生成良好计划的学习方法：整合HTN学习和加强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider how to learn Hierarchical Task Networks (HTNs) for planning problems in which both the quality of solution plans generated by the HTNs and the speed at which those plans are found is important. We describe an integration of HTN Learning with Reinforcement Learning to both learn methods by analyzing semantic annotations on tasks and to produce estimates of the expected values of the learned methods by performing Monte Carlo updates. We performed an experiment in which plan quality was inversely related to plan length. In two planning domains, we evaluated the planning performance of the learned methods in comparison to two state-of-the-art satisficing classical planners, FASTFORWARD and SGPLAN6, and one optimal planner, HSP*F. The results demonstrate that a greedy HTN planner using the learned methods was able to generate higher quality solutions than SGPLAN6 in both domains and FASTFORWARD in one. Our planner, FASTFORWARD, and SGPLAN6 ran in similar time, while HSP*F was exponentially slower.

机译：我们考虑如何学习分层任务网络（HTNS），以规划问题的问题，其中HTNS生成的解决方案计划的质量和那些计划的速度很重要。我们通过分析任务的语义注释和通过执行Monte Carlo更新来描述HTN学习与钢筋学习的集成，以通过分析对任务的语义注释，并通过执行Monte Carlo更新来产生所学习方法的预期值的估计。我们进行了一个实验，计划质量与计划长度相反。在两个规划域中，我们评估了学习方法的规划性能与两个最先进的古典规划者，快速和SGPLAN6以及一个最优规划师，HSP * F的态度相比。结果表明，使用学习方法的贪婪HTN规划器能够在两个域中产生比SGPLAN6更高的质量解决方案，并且在一个方面快速。我们的策划者，Fastforward和Sgplan6类似的时间，而HSP * F是指数较慢的。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2010年||共6页
会议地点
作者
Chad Hogg; Ugur Kuter; Hector Munoz-Avila;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. On Reinforcement Learning Methods for Generating Train Marshaling Plan Considering Group Layout of Freight Cars [J] . Yoichi Hiroshima IAENG Internaitonal journal of computer science . 2012,第3期

机译：考虑货车组布局的列车编组计划的强化学习方法
2. On Reinforcement Learning Methods for Generating Train Marshaling Plan Considering Group Layout of Freight Cars [J] . Yoichi Hirashima Australasian Plant Disease Notes . 2012,第3期

机译：考虑货车组布局的列车编组计划的强化学习方法
3. Relevant experience learning:A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments [J] . Zijian HU, Xiaoguang GAO, Kaifang WAN, 中国航空学报（英文版） . 2021,第012期

机译：相关体验学习：复杂未知环境中无人机自主运动规划的深度加强学习方法
4. Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning [C] . Chad Hogg, Ugur Kuter, Hector Munoz-Avila Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;IAAI-10;Symposium on educational advances in artificial intelligence;AAAI-10;EAAI-10 . 2011

机译：产生良好计划的学习方法：HTN学习与强化学习的整合
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. The Efficacy of Three Learning Methods Collaborative Context-Based Learning and Traditional on Learning Attitude and Behaviour of Undergraduate Nursing Students: Integrating Theory and Practice [O] . Ali Hasanpour-Dehkordi, Kamal Solati 2016

机译：三种学习方法的协作基于上下文的学习和传统的学习对本科护理学生学习态度和行为的功效：理论与实践的结合
7. A Comparison Of Supervised And Reinforcement Learning Methods On A Reinforcement Learning Task [O] . Vijaykumar Gullapalli 1992

机译：强化学习任务中监督学习和强化学习方法的比较
8. CaMeL: Learning Method Preconditions for HTN Planning. [R] . Ilghami, O., Nau, D. S., Munoz, Avila, H., 2006

机译：CameL：HTN规划的学习方法前提条件。

Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅