...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >OCEAN Online Task Inference for Compositional Tasks with Context Adaptation
【24h】

OCEAN Online Task Inference for Compositional Tasks with Context Adaptation

机译:海洋在线任务推论,具有语境适应的组成任务

获取原文
           

摘要

Real-world tasks often exhibit a compositional structure that contains a sequence of simpler sub-tasks. For instance, opening a door requires reaching, grasping, rotating, and pulling the door knob. Such compositional tasks require an agent to reason about the sub-task at hand while orchestrating global behavior accordingly. This can be cast as an online task inference problem, where the current task identity, represented by a context variable, is estimated from the agent’s past experiences with probabilistic inference. Previous approaches have employed simple latent distributions, e.g., Gaussian, to model a single context for the entire task. However, this formulation lacks the expressiveness to capture the composition and transition of the sub-tasks. We propose a variational inference framework OCEAN to perform online task inference for compositional tasks. OCEAN models global and local context variables in a joint latent space, where the global variables represent a mixture of sub-tasks required for the task, while the local variables capture the transitions between the sub-tasks. Our framework supports flexible latent distributions based on prior knowledge of the task structure and can be trained in an unsupervised manner. Experimental results show that OCEAN provides more effective task inference with sequential context adaptation and thus leads to a performance boost on complex, multi-stage tasks.
机译:现实世界的任务通常表现出一种组成结构,其中包含一系列更简单的子任务。例如,打开门需要伸手,抓住,旋转和拉动门旋钮。这种组成任务需要一个代理商在编排全球行为的同时在手头上进行处理,同时协调。这可以作为在线任务推断问题,其中,当前由上下文变量表示的当前任务标识,估计代理的概率推断的过去的经历。以前的方法已经采用了简单的潜在分布,例如高斯,为整个任务建模单个上下文。然而,这种配方缺乏捕捉子任务的组成和转换的表现力。我们提出了一个变分的推理框架海洋,在线任务推理进行组建任务。海洋模型在联合潜在空间中的全局和本地上下文变量,其中全局变量代表任务所需的子任务的混合,而局部变量捕获子任务之间的转换。我们的框架基于任务结构的先验知识支持灵活的潜在分布,可以以无人监督的方式培训。实验结果表明,海洋提供了更有效的任务推断,顺序上下文适应,从而导致复杂多级任务的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号