Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models

Buser Say; Scott Sanner

首页> 外文期刊>Artificial intelligence >Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models

【24h】

Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models

机译：具有学习二金属化神经网络转换模型的考核状态和行动空间的规划紧凑和高效的编码

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we leverage the efficiency of Binarized Neural Networks (BNNs) to learn complex state transition models of planning domains with discretized factored state and action spaces. In order to directly exploit this transition structure for planning, we present two novel compilations of the learned factored planning problem with BNNs based on reductions to Weighted Partial Maximum Boolean Satisfiability (FD-SAT-Plan+) as well as Binary Linear Programming (FD-BLP-Plan+). Theoretically, we show that our SAT-based Bi-Directional Neuron Activation Encoding is asymptotically the most compact encoding relative to the current literature and supports Unit Propagation (UP) - an important property that facilitates efficiency in SAT solvers. Experimentally, we validate the computational efficiency of our Bi-Directional Neuron Activation Encoding in comparison to an existing neuron activation encoding and demonstrate the ability to learn complex transition models with BNNs. We test the runtime efficiency of both FD-SAT-Plan+ and FD-BLP-Plan+ on the learned factored planning problem showing that FD-SAT-Plan+ scales better with increasing BNN size and complexity. Finally, we present a finite-time incremental constraint generation algorithm based on generalized landmark constraints to improve the planning accuracy of our encodings through simulated or real-world interaction.

机译：在本文中，我们利用二值化神经网络（BNN）的效率来学习规划域的复杂状态转换模型，具有离散的因子状态和行动空间。为了直接利用这种过渡结构进行规划，我们提出了基于对加权部分最大布尔可满足（FD-SAT-Plan +）以及二进制线性编程（FD-BLP）的加权部分最大布尔满足性（FD-BLP -plan +）。从理论上讲，我们基于SAT的双向神经元激活编码是相对于当前文献的最紧凑的编码，支持单元传播（向上） - 一种重要的属性，便于SAT溶剂中的效率。实验地，与现有的神经元激活编码相比，我们验证了我们的双向神经元激活编码的计算效率，并证明了使用BNN学习复杂转换模型的能力。我们测试FD-SAT计划+和FD-BLP-PLAN +的运行时间效率在学习的因素规划问题上，表明FD-SAT计划+尺度更好，随着BNN大小和复杂性的增加。最后，我们提出了一种基于广义地标约束的有限时间增量约束生成算法，通过模拟或现实世界互动来提高我们的编码的规划准确性。

著录项

来源
《Artificial intelligence》 |2020年第8期|103291.1-103291.21|共21页
作者
Buser Say; Scott Sanner;
展开▼
作者单位

Department of Mechanical & Industrial Engineering University of Toronto Canada Vector Institute Canada Faculty of Information Technology Monash University Australia;

Department of Mechanical & Industrial Engineering University of Toronto Canada Vector Institute Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Data-driven planning; Binarized Neural Networks; Weighted Partial Maximum Boolean; Satisfiability; Binary Linear Programming;

机译：数据驱动规划;二金属化神经网络;加权部分最大布尔;可靠性;二进制线性规划;

相似文献

外文文献
中文文献
专利

1. An efficient unconstrained facial expression recognition algorithm based on Stack Binarized Auto-encoders and Binarized Neural Networks [J] . Sun Wenyun, Zhao Haitao, Jin Zhong Neurocomputing . 2017,第deca6期

机译：基于堆栈二值化自动编码器和二值化神经网络的高效无约束表情识别算法
2. Scalable Planning with Deep Neural Network Learned Transition Models [J] . Ga Wu, Buser Say, Scott Sanner The Journal of Artificial Intelligence Research . 2020,第7期

机译：具有深度神经网络的可扩展规划，学习过渡模型
3. CDbin: Compact Discriminative Binary Descriptor Learned With Efficient Neural Network [J] . Ye Jianming, Zhang Shiliang, Huang Tiejun, IEEE Transactions on Circuits and Systems for Video Technology . 2020,第3期

机译：CDBIN：用高效的神经网络学习紧凑的辨别二元描述符
4. Planning in Factored State and Action Spaces with Learned Binarized Neural Network Transition Models [C] . Buser Say, Scott Sanner International Joint Conference on Artificial Intelligence . 2018

机译：学习二金属化神经网络过渡模型的考虑状态和行动空间中的规划
5. Optimal Planning with Learned Neural Network Transition Models [D] . Say, Buser. 2020

机译：学习神经网络过渡模型的最优规划
6. An Efficient and Perceptually Motivated Auditory Neural Encoding and Decoding Algorithm for Spiking Neural Networks [O] . Zihan Pan, Yansong Chua, Jibin Wu, 2019

机译：用于尖峰神经网络的高效且感知激励的听觉神经编码和解码算法
7. Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models [O] . Buser Say, Scott Sanner 2020

机译：具有学习二金属化神经网络转换模型的考核状态和行动空间的规划紧凑和高效的编码

Compact and efficient encodings for planning in factored state and action spaces with learned Binarized Neural Network transition models

摘要

著录项

相似文献

相关主题

期刊订阅