【24h】

Counting-MLNs: Learning Relational Structure for Decision Making

机译:MLN计数:学习决策的关系结构

获取原文

摘要

Many first-order probabilistic models can be represented much more compactly using aggregation operations such as counting. While traditional statistical relational representations share factors across sets of interchangeable random variables, representations that explicitly model aggregations also exploit interchange-ability of random variables within factors. This is especially useful in decision making settings, where an agent might need to reason about counts of the different types of objects it interacts with. Previous work on counting formulas in statistical relational representations has mostly focused on the problem of exact inference on an existing model. The problem of learning such models is largely unexplored. In this paper, we introduce Counting Markov Logic Networks (C-MLNs), an extension of Markov logic networks that can compactly represent complex counting formulas. We present a structure learning algorithm for C-MLNs; we apply this algorithm to the novel problem of generalizing natural language instructions, and to relational reinforcement learning in the Crossblock domain, in which standard MLN learning algorithms fail to find any useful structure. The C-MLN policies learned from natural language instructions are compact and intuitive, and, despite requiring no instructions on test games, win 20% more Crossblock games than a state-of-the-art algorithm for following natural language instructions.
机译:使用聚合操作(例如计数)可以更紧凑地表示许多一阶概率模型。尽管传统的统计关系表示法在可互换的随机变量集之间共享因子,但是对模型进行显式建模的表示法也利用了因子内随机变量的互换性。这在决策设置中特别有用,在该设置中,代理可能需要推理与之交互的不同类型对象的数量。先前关于统计关系表示中的公式计数的工作主要集中在对现有模型进行精确推断的问题上。学习此类模型的问题在很大程度上尚待探讨。在本文中,我们介绍了计数马尔可夫逻辑网络(C-MLN),它是马尔可夫逻辑网络的扩展,可以紧凑地表示复杂的计数公式。我们提出了一种针对C-MLN的结构学习算法;我们将此算法应用于广义自然语言指令的新颖性问题,并应用于Crossblock域中的关系强化学习,在该学习中,标准MLN学习算法无法找到任何有用的结构。从自然语言指令中学到的C-MLN策略紧凑而直观,尽管不需要测试游戏上的指令,但与遵循自然语言指令的最新算法相比,Crossblock游戏的胜率要高出20%。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号