首页> 外文学位 >Experimentation-driven learning of planning operators.
【24h】

Experimentation-driven learning of planning operators.

机译:实验驱动的规划操作员学习。

获取原文
获取原文并翻译 | 示例

摘要

Knowledge engineering for planning is expensive and the resulting knowledge can be imperfect. To autonomously learn a plan operator definition from environmental feedback, an operator learning system WISER explores an instantiated literal space using a breadth-first search technique. Each node of the search tree represents a world state, a unique subset of the instantiated literal space. A state at the root node is called a seed state. WISER can generate seed states with or without utilizing imperfect expert knowledge. WISER experiments with an operator at each node. A positive state, in which an operator can be successfully executed, constitutes initial preconditions of an operator. We analyze the number of required experiments as a function of the number of missing preconditions in a seed state. We introduce a naive domain assumption to test only a subset of the exponential state space. Since breadth-first search is expensive, WISER introduces two search techniques to reorder literals at each level of the search tree. We demonstrate performance improvement using the naive domain assumption and literal-ordering heuristics. Unlike other systems, WISER can model the effects on objects, which are not bound to any operator parameter, by introducing a variable. We show that a machine-generated definition of effects is often simpler in representation than expert-provided definitions. To refine imperfect operators when the internal expectation is not consistent with the external reality, WISER monitors and searches through an instantiated literal space during plan execution. Compared to some systems, WISER needs less monitoring and learns multiple missing literals without relying on a previous success case. Assuming that the system can transition to any user-specified state to perform experiments can be unrealistic. To overcome this shortcoming, we expand WISER to experiment with operators in a current state, monitor results, and experiment again from the naturally-generated new state. we present the empirical results showing that our new approach performs better than our previous approach.
机译:用于计划的知识工程非常昂贵,因此产生的知识可能不完善。为了从环境反馈中自动学习计划操作员定义,操作员学习系统WISER使用广度优先搜索技术探索实例化的文字空间。搜索树的每个节点表示一个世界状态,即实例化文字空间的唯一子集。根节点处的状态称为种子状态。 WISER可以使用或不使用不完善的专业知识来生成种子状态。 WISER在每个节点上与一个运算符进行实验。可以成功执行操作员的积极状态构成了操作员的初始前提。我们根据种子状态中缺少的前提条件的数量来分析所需实验的数量。我们引入一个朴素域假设来仅测试指数状态空间的一个子集。由于广度优先搜索价格昂贵,因此WISER引入了两种搜索技术来对搜索树的每个级别的文字进行重新排序。我们展示了使用朴素域假设和文字顺序试探法的性能改进。与其他系统不同,WISER可以通过引入变量来对不受任何操作员参数约束的对象的效果进行建模。我们表明,机器生成的效果定义通常比专家提供的定义更简单。为了在内部期望与外部现实不一致时完善不完善的算子,WISER在计划执行期间监视并搜索实例化的文字空间。与某些系统相比,WISER不需要进行任何监视,并且无需依赖先前的成功案例即可学习多个缺少的文字。假设系统可以转换到用户指定的任何状态以执行实验可能是不现实的。为克服此缺点,我们将WISER扩展为可以在当前状态下对操作员进行实验,监视结果并从自然生成的新状态下再次进行实验。我们提供的经验结果表明,我们的新方法比以前的方法表现更好。

著录项

  • 作者

    Tae, Kang Soo.;

  • 作者单位

    The University of Texas at Arlington.;

  • 授予单位 The University of Texas at Arlington.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1997
  • 页码 133 p.
  • 总页数 133
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号