首页> 外文会议>International Conference on Automated Planning and Scheduling(ICAPS 2007); 2007; >Discovering Relational Domain Features for Probabilistic Planning
【24h】

Discovering Relational Domain Features for Probabilistic Planning

机译:发现关系域特征以进行概率规划

获取原文
获取原文并翻译 | 示例

摘要

In sequential decision-making problems formulated as Markov decision processes, state-value function approximation using domain features is a critical technique for scaling up the feasible problem size. We consider the problem of automatically finding useful domain features in problem domains that exhibit relational structure. Specifically we consider learning compact relational features without input from human expertise; we use neither expert decisions nor human domain knowledge beyond the basic domain definition. We propose a method to learn relational features for a linear value-function representation-numerically valued features are selected by their fit to the Bellman residual of the current value function and are automatically learned and added to the representation when needed. Starting with only a trivial feature in the value-function representation, our method finds useful value functions by combining feature learning with approximate value iteration. Empirical work presented here for Tetris and for probabilistic planning competition domains shows that our technique represents the state-of-the-art for both domain-independent feature learning and for stochastic planning in relational domains.
机译:在表述为马尔可夫决策过程的顺序决策问题中,使用领域特征进行状态值函数逼近是扩大可行问题规模的一项关键技术。我们考虑在显示关系结构的问题域中自动查找有用的域特征的问题。具体来说,我们考虑在没有人类专业知识的情况下学习紧凑的关系特征。在基本领域定义之外,我们既不使用专家决策也不使用人类领域知识。我们提出了一种学习线性值函数表示的关系特征的方法-通过将数字值特征拟合到当前值函数的Bellman残差来选择数字值特征,然后自动学习它们,并在需要时将其添加到表示中。我们的方法仅从值函数表示中的琐碎特征开始,通过将特征学习与近似值迭代相结合来找到有用的值函数。这里针对俄罗斯方块和概率规划竞争领域提出的经验工作表明,我们的技术代表了与领域无关的特征学习和关系领域随机规划的最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号