首页> 美国政府科技报告 >First Results with Instance-Based State Identification for Reinforcement Learning
【24h】

First Results with Instance-Based State Identification for Reinforcement Learning

机译:基于实例的状态识别的强化学习的第一个结果

获取原文

摘要

When a reinforcement learning agent's next course of action depends oninformation that is hidden from the sensors because of problems such as occlusion, restricted range, bounded field of view and limited attention, we say the agent suffers from the hidden state problem. State identification techniques use history information to uncover hidden state. Previous approaches to encoding history include: finite state machines, recurrent neural networks, and genetic programming with indexed memory. A chief disadvantage of all these techniques is their long training time. This report presents instance-based state identification, a new approach to reinforcement learning with state identification that learns with much fewer training steps. Noting that learning with history and learning in continuous spaces both share the property that they begin without knowing the granularity of the state space, the approach applies instance-based (or memory-based) learning to history sequences-instead of recording instances in a continuous geometrical space, we record instances in action-perception-reward sequence space. The first implementation of this approach, called Nearest Sequence Memory, learns with an order of magnitude fewer steps than several previous approaches.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号