首页> 外文会议>AAAI Conference on Artificial Intelligence >Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior

【24h】

Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior

机译：Deep Bayesian非参加规则和计划从先前与学习自动化的示威活动

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a method to learn imitative policies from expert demonstrations that are interpretable and manipulable. We achieve interpretability by modeling the interactions between high-level actions as an automaton with connections to formal logic. We achieve manipulability by integrating this automaton into planning, so that changes to the automaton have predictable effects on the learned behavior. These qualities allow a human user to first understand what the model has learned, and then either correct the learned behavior or zero-shot generalize to new, similar tasks. We build upon previous work by no longer requiring additional supervised information which is hard to collect in practice. We achieve this by using a deep Bayesian nonparametric hierarchical model. We test our model on several domains and also show results for a real-world implementation on a mobile robotic arm platform.

机译：我们介绍了一种从可解释和可操纵的专家演示学习模仿政策的方法。我们通过将高级动作与具有与正式逻辑连接的自动化的自动化建模的相互作用来实现解释性。我们通过将该自动机集成到规划中来实现可操纵性，因此对自动机构的变化对学习行为具有可预测的影响。这些品质允许人类用户首先了解模型已经学习了什么，然后纠正了学习行为或零拍摄的概括到新的类似任务。我们基于以前的工作，不再需要额外的监督信息，这很难在实践中收集。我们通过使用深贝叶斯非参数分层模型来实现这一目标。我们在几个域上测试我们的模型，并在移动机器人臂平台上显示了真实世界的结果。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2020年|9766-10433p|共9页
会议地点
作者
Brandon Araki; Kiran Vodrahalli; Thomas Leech; Cristian-Ioan Vasile; Mark Donahue; Daniela Rus;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian Nonparametric Reward Learning From Demonstration [J] . Michini Bernard, Walsh Thomas J., Agha-Mohammadi Ali-Akbar, Robotics, IEEE Transactions on . 2015,第2期

机译：示范贝叶斯非参数奖励学习
2. A nonparametric Bayesian approach toward robot learning by demonstration [J] . Sotirios P. Chatzis, Dimitrios Korkinof, Yiannis Demiris Robotics and Autonomous Systems . 2012,第6期

机译：通过演示进行机器人学习的非参数贝叶斯方法
3. Capturing and Understanding Workers' Activities in Far-Field Surveillance Videos with Deep Action Recognition and Bayesian Nonparametric Learning [J] . Luo Xiaochun, Li Heng, Yang Xincong, Computer-Aided Civil and Infrastructure Engineering . 2019,第4期

机译：通过深度动作识别和贝叶斯非参数学习在远距离监视视频中捕捉和理解工人的活动
4. Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior [C] . Brandon Araki, Kiran Vodrahalli, Thomas Leech, AAAI Conference on Artificial Intelligence . 2020

机译：Deep Bayesian非参加规则和计划从先前与学习自动化的示威活动
5. Composing Deep Learning and Bayesian Nonparametric Methods [D] . Zhang, Aonan . 2019

机译：撰写深层学习和贝叶斯非参数方法
6. Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery [O] . Jeya Balaji Balasubramanian, Vanathi Gopalakrishnan 2018

机译：用于知识集成生物标记发现的贝叶斯规则学习的可调结构先验
7. 1Bayesian Nonparametric Reward Learning from Demonstration [O] . Bernard Michini, Thomas J. Walsh, Ali-akbar Agha-mohammadi, 2015

机译：基于示范的1Bayesian非参数奖励学习

Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior

摘要

著录项

相似文献

相关主题

期刊订阅