首页> 外文会议>German conference on multiagent system technologies >Employing Automatic Temporal Abstractions to Accelerate Utile Suffix Memory Algorithm
【24h】

Employing Automatic Temporal Abstractions to Accelerate Utile Suffix Memory Algorithm

机译:利用自动时间抽象来加速Utile后缀存储算法

获取原文

摘要

The main objective of the memory based reinforcement learning algorithms for hidden state problems is to overcome the state aliasing issue using a form of short term memory during learning. Extended sequence tree method, on the other hand, is a sequence based automated temporal abstraction mechanism that can be appended to a reinforcement learning algorithm. Assuming a fully observable problem setting, it tries to find useful sub-policies in solution space that can be reused as timed actions, providing significant savings in terms of learning time. This paper presents a way to expand a well known memory based model-free reinforcement learning algorithm, namely Utile Suffix Memory, by using a modified version of extended sequence tree method. By this way, learning speed of the algorithm is increased under certain conditions. Enhancement is shown empirically via experimentation on some benchmark problems.
机译:基于存储器的用于隐藏状态问题的强化学习算法的主要目标是在学习过程中使用短期记忆的形式来克服状态混叠问题。另一方面,扩展序列树方法是一种基于序列的自动时间抽象机制,可以附加到强化学习算法中。假设问题的解决方案是完全可观察的,它会尝试在解决方案空间中找到有用的子策略,这些策略可以作为定时操作重用,从而在学习时间方面节省了很多时间。本文提出了一种使用扩展序列树方法的改进版本来扩展基于众所周知的基于内存的无模型增强学习算法的方法,即Utile Suffix Memory。通过这种方式,在一定条件下提高了算法的学习速度。通过对一些基准问题进行实验,经验表明了这种增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号