Learning Macro-Actions for Arbitrary Planners and Domains

机译：学习针对任意计划者和领域的宏观行动

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many complex domains and even larger problems in simple domains remain challenging in spite of the recent progress in planning. Besides developing and improving planning technologies, re-engineering a domain by utilising acquired knowledge opens up a potential avenue for further research. Moreover, macro-actions, when added to the domain as additional actions, provide a promising means by which to convey such knowledge. A macro-action, or macro in short, is a group of actions selected for application as a single choice. Most existing work on macros exploits properties explicitly specific to the planners or the domains. However, such properties are not likely to be common with arbitrary planners or domains. Therefore, a macro learning method that does not exploit any structural knowledge about planners or domains explicitly is of immense interest. This paper presents an offline macro learning method that works with arbitrarily chosen planners and domains. Given a planner, a domain, and a number of example problems, the learning method generates macros from plans of some of the given problems under the guidance of a genetic algorithm. It represents macros like regular actions, evaluates them individually by solving the remaining given problems, and suggests individual macros that are to be added to the domain permanently. Genetic algorithms are automatic learning methods that can capture inherent features of a system using no explicit knowledge about it. Our method thus does not strive to discover or utilise any structural properties specific to a planner or a domain.

机译：尽管最近在规划方面取得了进展，但许多复杂领域甚至是简单领域中更大的问题仍然具有挑战性。除了开发和改进计划技术外，通过利用获得的知识对领域进行重新设计还为进一步研究提供了潜在的途径。而且，当将宏动作作为附加动作添加到域中时，它提供了一种有前途的手段来传达这种知识。宏动作，或简称为宏，是一组作为单个选项选择应用的动作。大多数有关宏的现有工作都利用明确针对计划者或领域的属性。但是，此类属性在任意计划者或领域中不太可能共有。因此，没有显式利用有关计划者或领域的任何结构性知识的宏观学习方法引起了极大的兴趣。本文提出了一种离线宏学习方法，该方法可与任意选择的计划者和领域一起使用。给定一个计划者，一个领域和许多示例问题，该学习方法在遗传算法的指导下从某些给定问题的计划中生成宏。它代表类似于常规动作的宏，通过解决剩余的给定问题对它们进行单独评估，并建议将要永久添加到域中的单个宏。遗传算法是一种自动学习方法，无需使用明确的知识即可捕获系统的固有特征。因此，我们的方法并不努力去发现或利用特定于计划者或领域的任何结构特性。

著录项

来源
《International Conference on Automated Planning and Scheduling(ICAPS 2007); 2007;》|2007年|P.256-263|共8页
会议地点
作者
M.A. Hakim Newton; John Levine; Maria Fox; Derek Long;
展开▼
作者单位

Computer and Information Sciences University of Strathclyde Glasgow, United Kingdom;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 N12;
关键词
入库时间 2022-08-26 14:15:26

相似文献

外文文献
中文文献
专利

1. Marvin: A Heuristic Search Planner with Online Macro-Action Learning [J] . Coles A. I., Smith A. J. The Journal of Artificial Intelligence Research . 2007,第12期

机译：Marvin：具有在线宏动作学习的启发式搜索计划器
2. Marvin: A Heuristic Search Planner with Online Macro-Action Learning [J] . Andrew Coles, Amanda Smith The Journal of Artificial Intelligence Research . 2007,第0期

机译：Marvin：具有在线宏动作学习的启发式搜索计划器
3. Marvin: A Heuristic Search Planner with Online Macro-Action Learning [J] . A. I. Coles, A. J. Smith Journal of Automation, Mobile Robotics & Intelligent Systems . 2007,第1期

机译：Marvin：具有在线宏动作学习的启发式搜索计划器
4. Learning Macro-Actions for Arbitrary Planners and Domains [C] . M.A. Hakim Newton, John Levine, Maria Fox, International Conference on Automated Planning and Scheduling . 2007

机译：学习任意规划者和域名的宏观行动
5. Randomized Wavelets on Arbitrary Domains and Applications to Functional MRI Analysis. [D] . Ozkaya, Gorkem G. 2012

机译：任意域上的随机小波及其在功能性MRI分析中的应用。
6. An systematic review of e-learning outcomes in undergraduate dentalradiology curricula—levels of learning and implications for researchersand curriculum planners [O] . Michael G Botelho, Kalpana R Agrawal, Michael M Bornstein 2019

机译：对牙科学生电子学习成果的系统评价放射学课程—学习水平和对研究人员的影响和课程计划者
7. Marvin: A Heuristic Search Planner with Online Macro-Action Learning [O] . Coles, A. I., Smith, A. J. 2011

机译：marvin：具有在线宏观行动学习的启发式搜索计划器

Learning Macro-Actions for Arbitrary Planners and Domains

摘要

著录项

相似文献

相关主题

期刊订阅