【24h】

Learning Macro-Actions for Arbitrary Planners and Domains

机译:学习针对任意计划者和领域的宏观行动

获取原文
获取原文并翻译 | 示例

摘要

Many complex domains and even larger problems in simple domains remain challenging in spite of the recent progress in planning. Besides developing and improving planning technologies, re-engineering a domain by utilising acquired knowledge opens up a potential avenue for further research. Moreover, macro-actions, when added to the domain as additional actions, provide a promising means by which to convey such knowledge. A macro-action, or macro in short, is a group of actions selected for application as a single choice. Most existing work on macros exploits properties explicitly specific to the planners or the domains. However, such properties are not likely to be common with arbitrary planners or domains. Therefore, a macro learning method that does not exploit any structural knowledge about planners or domains explicitly is of immense interest. This paper presents an offline macro learning method that works with arbitrarily chosen planners and domains. Given a planner, a domain, and a number of example problems, the learning method generates macros from plans of some of the given problems under the guidance of a genetic algorithm. It represents macros like regular actions, evaluates them individually by solving the remaining given problems, and suggests individual macros that are to be added to the domain permanently. Genetic algorithms are automatic learning methods that can capture inherent features of a system using no explicit knowledge about it. Our method thus does not strive to discover or utilise any structural properties specific to a planner or a domain.
机译:尽管最近在规划方面取得了进展,但许多复杂领域甚至是简单领域中更大的问题仍然具有挑战性。除了开发和改进计划技术外,通过利用获得的知识对领域进行重新设计还为进一步研究提供了潜在的途径。而且,当将宏动作作为附加动作添加到域中时,它提供了一种有前途的手段来传达这种知识。宏动作,或简称为宏,是一组作为单个选项选择应用的动作。大多数有关宏的现有工作都利用明确针对计划者或领域的属性。但是,此类属性在任意计划者或领域中不太可能共有。因此,没有显式利用有关计划者或领域的任何结构性知识的宏观学习方法引起了极大的兴趣。本文提出了一种离线宏学习方法,该方法可与任意选择的计划者和领域一起使用。给定一个计划者,一个领域和许多示例问题,该学习方法在遗传算法的指导下从某些给定问题的计划中生成宏。它代表类似于常规动作的宏,通过解决剩余的给定问题对它们进行单独评估,并建议将要永久添加到域中的单个宏。遗传算法是一种自动学习方法,无需使用明确的知识即可捕获系统的固有特征。因此,我们的方法并不努力去发现或利用特定于计划者或领域的任何结构特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号