...
首页> 外文期刊>Artificial intelligence >Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models
【24h】

Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models

机译:具有学习二金属化神经网络转换模型的考核状态和行动空间的规划紧凑和高效的编码

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we leverage the efficiency of Binarized Neural Networks (BNNs) to learn complex state transition models of planning domains with discretized factored state and action spaces. In order to directly exploit this transition structure for planning, we present two novel compilations of the learned factored planning problem with BNNs based on reductions to Weighted Partial Maximum Boolean Satisfiability (FD-SAT-Plan+) as well as Binary Linear Programming (FD-BLP-Plan+). Theoretically, we show that our SAT-based Bi-Directional Neuron Activation Encoding is asymptotically the most compact encoding relative to the current literature and supports Unit Propagation (UP) - an important property that facilitates efficiency in SAT solvers. Experimentally, we validate the computational efficiency of our Bi-Directional Neuron Activation Encoding in comparison to an existing neuron activation encoding and demonstrate the ability to learn complex transition models with BNNs. We test the runtime efficiency of both FD-SAT-Plan+ and FD-BLP-Plan+ on the learned factored planning problem showing that FD-SAT-Plan+ scales better with increasing BNN size and complexity. Finally, we present a finite-time incremental constraint generation algorithm based on generalized landmark constraints to improve the planning accuracy of our encodings through simulated or real-world interaction.
机译:在本文中,我们利用二值化神经网络(BNN)的效率来学习规划域的复杂状态转换模型,具有离散的因子状态和行动空间。为了直接利用这种过渡结构进行规划,我们提出了基于对加权部分最大布尔可满足(FD-SAT-Plan +)以及二进制线性编程(FD-BLP)的加权部分最大布尔满足性(FD-BLP -plan +)。从理论上讲,我们基于SAT的双向神经元激活编码是相对于当前文献的最紧凑的编码,支持单元传播(向上) - 一种重要的属性,便于SAT溶剂中的效率。实验地,与现有的神经元激活编码相比,我们验证了我们的双向神经元激活编码的计算效率,并证明了使用BNN学习复杂转换模型的能力。我们测试FD-SAT计划+和FD-BLP-PLAN +的运行时间效率在学习的因素规划问题上,表明FD-SAT计划+尺度更好,随着BNN大小和复杂性的增加。最后,我们提出了一种基于广义地标约束的有限时间增量约束生成算法,通过模拟或现实世界互动来提高我们的编码的规划准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号