【24h】

Learning Options for an MDP from Demonstrations

机译:从演示中学习MDP的选项

获取原文
获取外文期刊封面目录资料

摘要

The options framework provides a foundation to use hierarchical actions in reinforcement learning. An agent using options, along with primitive actions, at any point in time can decide to perform a macro-action made out of many primitive actions rather than a primitive action. Such macro-actions can be hand-crafted or learned. There has been previous work on learning them by exploring the environment. Here we take a different perspective and present an approach to learn options from a set of experts demonstrations. Empirical results are also presented in a similar setting to the one used in other works in this area.
机译:选项框架为在强化学习中使用分层操作提供了基础。使用选项的代理以及原始操作可以在任何时间点都可以决定执行由许多原始操作而不是原始操作的宏操作。这种宏动作可以手工制作或学习。以前通过探索环境来学习它们。在这里,我们采取了不同的视角,并提出了一种从一组专家演示中学习选项的方法。经验结果也呈现在该区域其他作品中使用的类似环境中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号