首页> 外文会议>International Conference on Automated Planning and Scheduling >Cross-Domain Action-Model Acquisition for Planning Via Web Search
【24h】

Cross-Domain Action-Model Acquisition for Planning Via Web Search

机译:通过Web搜索规划跨域动作模型采集

获取原文

摘要

Applying learning techniques to acquire action models is an area of intense research interest. Most previous works in this area have assumed that there is a significant amount of training data available in a planning domain of interest, which we call target domain, where action models are to be learned. However, it is often difficult to acquire sufficient training data to ensure that the learned action models are of high quality. In this paper, we develop a novel approach to learning action models with limited training data in the target domain by transferring knowledge from related auxiliary or source domains. We assume that the action models in the source domains have already been created before, and seek to transfer as much of the available information from the source domains as possible to help our learning task. We first exploit a Web searching method to bridge the target and source domains, such that transferrable knowledge from source domains is identified. We then encode the transferred knowledge together with the available data from the target domain as constraints in a maximum satisfiability problem, and solve these constraints using a weighted MAX-SAT solver. We finally transform the solutions thus obtained into high-quality target-domain action models. We empirically show that our transfer-learning based framework is effective in several domains, including the International Planning Competition (IPC) domains and some synthetic domains.
机译:应用学习技术获取动作模型是一个激烈的研究兴趣领域。此领域的最先前的工作已经假定有大量的培训数据,其中有趣的计划领域可用,我们调用目标域,其中要学习动作模型。但是,往往很难获得足够的训练数据,以确保学习的动作模型具有高质量。在本文中,我们通过从相关辅助或源极域转移知识来开发一种新的培训数据的学习动作模型的方法。我们假设源域中的动作模型已经在之前创建,并寻求从源域中从源域传输可用信息以帮助我们的学习任务。我们首先利用Web搜索方法来桥接目标和源域,从而识别来自源域的可传输知识。然后,我们将传送的知识与来自目标域中的可用数据一起编码为最大可满足问题中的约束,并使用加权的MAX-SAT求解器来解决这些约束。我们最终将如此获得的解决方案转换为高质量的目标域动作模型。我们经常表明,我们的转移基于学习的框架在若干域中有效,包括国际规划竞争(IPC)域和一些合成领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号