【24h】

Iterative Learning of Weighted Rule Sets for Greedy Search

机译:贪婪搜索的加权规则集的迭代学习

获取原文
获取原文并翻译 | 示例

摘要

Greedy search is commonly used in an attempt to generate solutions quickly at the expense of completeness and optimality. In this work, we consider learning sets of weighted action-selection rules for guiding greedy search with application to automated planning. We make two primary contributions over prior work on learning for greedy search. First, we introduce weighted sets of action-selection rules as a new form of control knowledge for greedy search. Prior work has shown the utility of action-selection rules for greedy search, but has treated the rules as hard constraints, resulting in brittleness. Our weighted rule sets allow multiple rules to vote, helping to improve robustness to noisy rules. Second, we give a new iterative learning algorithm for learning weighted rule sets based on RankBoost, an efficient boosting algorithm for ranking. Each iteration considers the actual performance of the current rule set and directs learning based on the observed search errors. This is in contrast to most prior approaches, which learn control knowledge independently of the search process. Our empirical results have shown significant promise for this approach in a number of domains.
机译:贪婪搜索通常用于尝试以完整性和最优性为代价快速生成解决方案。在这项工作中,我们考虑学习加权操作选择规则集,以指导贪婪搜索与自动计划的应用。对于先前的贪婪搜索学习工作,我们做出了两个主要贡献。首先,我们引入加权的行动选择规则集,作为贪婪搜索的一种新形式的控制知识。先前的工作已经显示了行动选择规则在贪婪搜索中的效用,但已将规则视为硬约束,从而导致了脆弱性。我们的加权规则集允许对多个规则进行投票,从而有助于提高对嘈杂规则的鲁棒性。其次,我们给出了一种新的迭代学习算法,用于基于RankBoost(一种高效的排名提升算法)来学习加权规则集。每次迭代都会考虑当前规则集的实际性能,并根据观察到的搜索错误指导学习。这与大多数现有方法相反,大多数现有方法独立于搜索过程来学习控制知识。我们的经验结果表明,该方法在许多领域都具有广阔的前景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号