首页> 外文会议>AAAI Conference on Artificial Intelligence >Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior
【24h】

Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior

机译:Deep Bayesian非参加规则和计划从先前与学习自动化的示威活动

获取原文

摘要

We introduce a method to learn imitative policies from expert demonstrations that are interpretable and manipulable. We achieve interpretability by modeling the interactions between high-level actions as an automaton with connections to formal logic. We achieve manipulability by integrating this automaton into planning, so that changes to the automaton have predictable effects on the learned behavior. These qualities allow a human user to first understand what the model has learned, and then either correct the learned behavior or zero-shot generalize to new, similar tasks. We build upon previous work by no longer requiring additional supervised information which is hard to collect in practice. We achieve this by using a deep Bayesian nonparametric hierarchical model. We test our model on several domains and also show results for a real-world implementation on a mobile robotic arm platform.
机译:我们介绍了一种从可解释和可操纵的专家演示学习模仿政策的方法。 我们通过将高级动作与具有与正式逻辑连接的自动化的自动化建模的相互作用来实现解释性。 我们通过将该自动机集成到规划中来实现可操纵性,因此对自动机构的变化对学习行为具有可预测的影响。 这些品质允许人类用户首先了解模型已经学习了什么,然后纠正了学习行为或零拍摄的概括到新的类似任务。 我们基于以前的工作,不再需要额外的监督信息,这很难在实践中收集。 我们通过使用深贝叶斯非参数分层模型来实现这一目标。 我们在几个域上测试我们的模型,并在移动机器人臂平台上显示了真实世界的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号