首页> 外文会议> >Automatic State Abstraction from Demonstration
【24h】

Automatic State Abstraction from Demonstration

机译:演示中的自动状态抽象

获取原文

摘要

Learning from Demonstration (LfD) is a popular technique for building decision-making agents from human help. Traditional LfD methods use demonstrations as training examples for supervised learning, but complex tasks can require more examples than is practical to obtain. We present Abstraction from Demonstration (AfD), a novel form of LfD that uses demonstrations to infer state abstractions and reinforcement learning (RL) methods in those abstract state spaces to build a policy. Empirical results show that AfD is greater than an order of magnitude more sample efficient than just using demonstrations as training examples, and exponentially faster than RL alone.
机译:从演示中学习(LfD)是一种流行的技术,可以通过人的帮助来建立决策者。传统的LfD方法使用演示作为监督学习的训练示例,但是复杂的任务可能需要比实际获得更多的示例。我们介绍了演示的抽象形式(AfD),它是LfD的一种新颖形式,它使用演示来推断那些抽象状态空间中的状态抽象和强化学习(RL)方法以构建策略。实验结果表明,与仅使用演示作为训练示例相比,AfD的采样效率要高出一个数量级,并且比单独的RL快几倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号