首页> 外文会议>IEEE International Conference on Robotics and Automation >Deep compositional robotic planners that follow natural language commands
【24h】

Deep compositional robotic planners that follow natural language commands

机译:遵循自然语言命令的深度合成机器人计划者

获取原文

摘要

We demonstrate how a sampling-based robotic planner can be augmented to learn to understand a sequence of natural language commands in a continuous configuration space to move and manipulate objects. Our approach combines a deep network structured according to the parse of a complex command that includes objects, verbs, spatial relations, and attributes, with a sampling-based planner, RRT. A recurrent hierarchical deep network controls how the planner explores the environment, determines when a planned path is likely to achieve a goal, and estimates the confidence of each move to trade off exploitation and exploration between the network and the planner. Planners are designed to have near-optimal behavior when information about the task is missing, while networks learn to exploit observations which are available from the environment, making the two naturally complementary. Combining the two enables generalization to new maps, new kinds of obstacles, and more complex sentences that do not occur in the training set. Little data is required to train the model despite it jointly acquiring a CNN that extracts features from the environment as it learns the meanings of words. The model provides a level of interpretability through the use of attention maps allowing users to see its reasoning steps despite being an end-to-end model. This end-to-end model allows robots to learn to follow natural language commands in challenging continuous environments.
机译:我们展示了如何增强基于采样的机器人规划器,以便学会了解在连续配置空间中的一系列自然语言命令来移动和操纵对象。我们的方法将根据复杂命令的解析组合了一个结构化的,该命令包括对象,动词,空间关系和属性,具有基于采样的策划器,RRT。经常性的等级深度网络控制计划者如何探讨环境,确定计划的路径可能实现目标何时实现目标,并估计每个举措的置信度在网络和策划者之间进行剥削和探索。策划人员旨在在缺少任务的信息时具有近最佳行为,而网络学会用于利用环境中可用的观测,使得两个自然互补。组合这两个使概括为新地图,新的障碍物,以及在训练集中不会发生的更复杂的句子。尽管它共同获取了从环境中提取的CNN,但培训模型需要训练该模型的少量数据是因为它学会了单词的含义。该模型通过使用注意力映射提供了一种可解释性,允许用户尽管是端到端模型,但是尽管是结束模型,但是尽管是结束模型,请查看其推理步骤。这种端到端模型允许机器人学习在挑战的连续环境中遵循自然语言命令。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号