首页> 外文会议>Brazilian Symposium on Neural Networks >Speeding up autonomous learning by using state-independent option policies ant termination improvement
【24h】

Speeding up autonomous learning by using state-independent option policies ant termination improvement

机译:通过使用国家无关的选项策略蚂蚁终止改进来加速自主学习

获取原文

摘要

In Reinforcement Learning applications such as autonomous robot navigation, the use of options (macro-operators) instead of low level actions has been reported to produce learning speedup due to a more agressive exploration of the state space. In this paper we present an evaluation of the use of option policies O{sub}S. Each option policy in this framework is a fixed sequence of actions, depending exclusively on the state in which the option is initiated. This contrasts with option policies O{sub}∏, more common in the literature and that correspond to action sequences that depend on the states visited during the execution of the options. One of our goals was to analyse the effects of a variation of the action sequence length for O{sub}S policies. The main contribution of the paper, however, is a study on the use of a Termination Improvement technique which allows for the abortion of option execution if a more promissing one is found. Experimental results show that Termination Improvement for O{sub}S options, whose benefits had already been reported for O{sub}∏ options, can be much more effective - due to its adaptation of the size of the action sequence depending on the state where the option is initiated - than indiscriminately augmenting the option size in order to increase exploration of the state space.
机译:在强化学习的应用,如自主机器人导航,使用的选项(宏观运营商),而不是低层次的行动已报产生学习加速由于状态空间的更积极的探索。在本文中,我们目前使用的选项政策-O {}子S的的评估。在这个框架中的每个选项的政策行动,专门在该选项启动的状态取决于固定顺序。这与选择政策-O {}子Π,多见于文学和对应于依赖的选项执行期间参观了美国的行动序列。我们的目标之一是分析动作序列长度为-O {}子的政策变化的影响。本文的主要贡献,但是,在使用终止改进技术,它允许选项执行的堕胎是一种更promissing一个被发现的研究。视的状态下,由于其动作顺序的大小适应 - 实验结果显示Ø{副} s ^期权,其收益已经报道-O {副}Π选项,终止改进,可以更有效选项开始 - 不是为了增加国家的空间探测胡乱扩充选项大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号